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MEMORANDUM 


If the modem student, who aspires to be an Actuary, is faced with 
a wider range of actuarial subjects and a more searching test of his 
understanding of actuarial principles than was the student of an 
earlier generation, he has this compensation — that the means to 
acquire such understanding are more readily available and put into 
a form which may more readily be assimilated. 

This book is such an example of the help afforded to the actuarial 
student, and he is fortunate to have the opportunity to study a uork 
of this merit at the very beginning of his Course. A complete grasp 
of its principles will not only assist him to satisfy his examiners as 
to his proficiency in the mathematical problems by which he will be ' 
tested, but when he has so satisfied them he will have laid such 
foundation for his future work that he will ever have cause to be 
grateful to the author of this treatise. 


H. J. P. O. 



NOTATION 


Pn(x) a polynomial (i.e., a rational integral function) of degree 
« in X. 


«(r) 


„(r) 


n (n- i) (n-z) . ..(n -r+ 1) _ n<'’> 
r(r-i)(r-2)...3.2.i “ r! ’ 

or negative, integral or fractional. 


where « may be positive 


This is represented in other wforks by the symbols ^ ^ 
n^; see y.l.A. vol. t.xiii, p. 58. 


« (n - 1 ) (« - 2) . . . — r + 1 ). 

There seems to be no recognized symbol for the more 
general factorial n {n — h)(n — 2h) ...{n-r— \h). It may 
sometimes be convenient to represent this by the same 
symbol n^'\ but in that case the symbol must be specially 
defined and consistently used. Cf. post, p. 19. 


{n+i){n + 2)...{n + r) {ti + ry’’ 

A different notation is employed in certain other works — 
see post, p. 19. 

the divided difference of order t. This notation is e.vplained 
**■■■ and reference is made to other notations in Chapter III. 

Paragraphs and examples marked with an asterisk, thus *, are 
intended for the advanced student only, and need not be read by 
students preparing for Part I of the Institute Examinations. 


REFERENCES 

J.I.A. Journal of the Institute of Actuaries. 

T.F.A. Tfansactions of the Faculty of Actuaries. 

J.S.S. Journal of the Institute of Actuaries Students' Society. 



AUTHOR’S PREFACE 

As explained in the Preface to Part I of this book, the revision of the 
syllabus for the Examinations of the Institute of Actuaries rendered 
it necessary to divide An Elementary Treatise on Actuarial Mathe- 
matics into two parts and to make additions to that part of the book 
intended for students who were reading for the first Part of the 
Examinations. The opportunity has been taken to revise thoroughly 
the Chapters on Finite Differences and Probability and to bring the 
book up to date. While little alteration has been made in a few 
Chapters of Actuarial Mathematics the majority have been entirely 
rewritten, and it is therefore advisable to refer to each Chapter of 
the present book in detail. 

Chapters I and II do not differ markedly from the first two Chap- 
ters on Finite Differences in Actuarial Mathematics. Chapter III, 
on Interpolation with Unequal Intervals, has been rewritten. The 
introduction of Dr Aitken’s notation has simplified the general pro- 
blem of divided differences, and as a result a more logical exposition 
of the theorems connected with these differences has been given. 
The earlier parts of Chapter IV, on Central Differences, have been 
remodelled, and much new matter inserted. With regard to Chap- 
ter V as given in Actuarial Mathematics it was thought that the 
treatment of the principles of Inverse Interpolation w as to a certain 
extent incomplete. Dr Aitken suggested lines on which improve- 
ment might be made and the initial paragraphs have therefore been 
redrafted. Considerable alterations have been made in the Chapter 
on Summation: the first few paragraphs have been recast, a new 
section dealing with the application of operators to the summation 
of algebraic series has been introduced and paragraphs on the 
operator [«] have been inserted. Chapter VII, on Miscellaneous 
Theorems, has been enlarged by the introduction of new material 
and a fuller treatment has been given of the matter in the previous 
book. Chapter VIII is entirely new-. It was considered that there 
was now an opportunity to collect in one Chapter a number of 
modem methods and special devices which have recently appeared 
from the pens of Mr Lidstone and Dr Aitken, but which are scat- 
tered over the pages of various Journals. The Chapter on Approxi- 
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mate Integration has been amended in order to clarify certain proofs 
and theorems, but is otherwise unaltered. 

In view of the introduction of Elementary Statistics into the 
syllabus for Part I of the Examinations it was considered advisable 
to approach the study of Probability from a slightly different angle. 
As a consequence, Chapter X has been largely rewritten. New 
Chapters on Elementary Statistics have been introduced: these 
Chapters do not deal exhaustively with the subject, but are, it is 
hoped, sufficient to enable the student to solve simple types of pro- 
blem met with in practical work. Little alteration has been made 
in the Chapter dealing with Mean Value and the application of tlie 
Calculus to Probability. 

The Examples are mainly those which have already appeared in 
Actuarial Mathematics. Examples ii and 12 contain exercises in 
Statistics and the Miscellaneous Examples contain many new 
questions. 

In the preparation of this Part of the book I have received much 
help from various sources. I have had access to the notes written 
for the Actuarial Tuition Ser\'ice and some of the Lessons on 
Probability and Statistics (prepared originally by P. M. Marplcs) 
have proved of value. My thanks are due to Mr H. Tetley and 
Mr O. C. J. Klagge for helpful suggestions on certain points. 

It is, however, to Dr A. C. Aitken and Mr G. J. Lidstone to whom 
I am most indebted. Dr Aitken generously put at my disposal his 
notes on various subjects connected with Finite Differences. In 
particular, his suggestions on Inverse Interpolation, the divided 
difference notation and the methods of cross-means have been such 
that I was able to adopt them with little alteration. 7 'hroughout the 
preparation of the book, from the earliest stages to the final proof- 
reading, Mr Lidstone has been my constant guide and mentor. 
A great deal of the work has been inspired by him and his comments 
and criticisms on all points have been invaluable. He has read 
through and annotated the whole of the book with that thoroughness 
for which he is justly famous, and I am deeply grateful to him for 
his help. 


March 1939 


H.F. 



FINITE DIFFERENCES 


CHAPTER I 

DEFINITIONS AND FUNDAMENTAL 
FORMULAE 

1 . The function y = a->rhx + cx~+ ...+kx^ is a rational integral 
function of the «th degree in jc, where the indices are positive 
integers, n being the greatest, and a,b,c ... k are constants, of 
which ^ + o but the others are unrestricted. 

A rational integral function is also called a polynomial, and it 
is convenient to represent a polynomial of the nth degree as 

Consider the simple polynomial j = = i -v -4- It is quite 

easy to obtain the value of y corresponding to any value of .v by 
substituting that value of .r on the right-hand side of the equation. 
For example 

A 012345678 
.V I 3 7 >3 21 31 43 57 73 

It will be found that for successive integral values of x in the 
above table the values of y have interesting properties. If from 
each value of y the previous value of y be subtracted, we obtain a 
new set of figures; 

(a) 2 4 6 8 10 12 14 16 

and if the subtraction be performed on these figures in the same 
way the new differences are 

((3) 2 2 2 2 2 2 2 

The sequence of 2’s in (/3) is not a mere coincidence: it will be 
shoum later that when y has the value supposed all the terms in 
(/3) have the same value, 2, however far the series extends. 

This leads us to another method of obtaining values of y. Sup- 
pose that we write down tlie original table in a different form, and 
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include in the table the two seta of figures (a) and (^) thus: 

(«) 03 ) 


" V' 

0 

1 

2 

3 

4 

5 

6 

7 

8 


y 

I 

3 

7 

13 

21 

31 

43 

57 

73 


2 

4 

6 

8 

lO 

12 

H 

i6 


2 

2 

2 

2 

2 


We can now find any further value oiy by extending the columns 
(a) and (/3). We must however work from (^) to (a) and then to 
y instead of from y to (a) and then to (/3) as has already been done. 
For example, to obtain the value of y when x has the value 9, i.e. 
to obtain a new 2 must be inserted in the (/S) column: the new 
value in the (a) column will be 16 + 2= 18, and the required value 
of will be 73 + 18 = 91. To find Wjo the process is continued. Any 
value of y corresponding to an integral value of x can be obtained 
in a similar manner. 


2. The above is a particular instance of a far more general set of 
operations. We have used the simplest possible numerical values 
of X, namely the natural numbers, and we have evolved our 
example from a known quadratic function y = Uj.= i+x-\-x'^. As 
a general rule the form of the function is not known and the given 
values of * are not necessarily consecutive integers. 

3. Now suppose that instead of numerical values of x differing by 
unity we have the following consecutive values of x : 

a, a+h, a + zh, 0 + 3 ^, ..., 
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where the values of x differ by a quantity h instead of by 
unity. 

Then if the function be still y = u^ the values of y corresponding 
to the above values of x will be 

*^tt+2A* **a+3A> • • • • 

In order to form a column similar to column (a) above we shall 
have to write down 


^a-^h ^a+h* ^a+3A ^a-t-2A 

These are called the first differences of the function y = u^ and 
are denoted by . . . 

where A is not a quantity but a symbol representing an “operation”. 
Column (^), being the differences of column (a), will be 


(*^a4-2A ^a-^h) i^a+h 

{*^a+3A “ ~ (*^o+»A ~ 


or, more shortly, 




These are called the second differences of and are denoted by 
A Ug, ^ A Un+aA •••» 

where, it must be emphasized, the symbol A^ does not represent the 
square of a quantity but denotes the repetition of the operation A. 

Similarly, third, fourth, ... nth differences, formed in exactly the 
same way, arc denoted by 

A»u„, AX ... A"m„. 

4. Before forming a difference table similar to that in paragraph i, 
it is convenient to introduce alternative names for x and y in our 
equation jv = Where our ultimate object is to obtain numerical 
values of x or y, the independent variable is often termed the 
argument, and the corresponding value of y the entry. 
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In a table of logarithms the number itself is the argument and the 
logarithm the entry. The converse holds in a table of antilogarithms, 
where the logarithm is the argument. Similarly in a table of sin a, a is 
the argument and the sine the entry, whereas a is the entry in a table of 
sin”* a. 

5. Our new difference table is therefore 




First 

Second 

Third 

Argument 

Kntry 

differences 

differences 

differences 

a 


Au„ 



a + h 

^a+h 

AX 




A 


A3«„ 

a + 2 h 

42A 

*^*^04 271 


•^®«a+A 

a + 2,h 

**a+3A 

( »A 

AX.2A 

f 2A 

a + ^ 

*^a+4A 

* lA 



a + ^h 

*^a+6A 





The first term in the table («<,) is called the leading term, and the 
differences which stand at the head of the respective columns, 
namely.A« 3 , A®^^ ..., are called the leading differences. 


6. .Although we have expressed the terms in the difference table 
by the use of A symbols, it is quite easy to obtain any difference 
in terms of the function alone. 

For example, A®«„ is the difference between and A*j/„, 

or AX = ^X+a-*^X- 

Again, A’^u„ is the di.'ference between Aw„,^ and A«q, or 
A*Ug — Au„^;, Au„, 
and as Au„ = u^^^-u^, 
we have AX = AX+a - 

= (*^«a42A - “ AwJ 

= '^«a42A-2AM„+h + A«„ 

= ("a+SA - «a+2A) “ 2 («a+2A ' “a+h) + («o+A ' “a) 

■= “a+iJA - 3“a+2A + 3««+A - “a- 



DIFFERENCE TABLE 5 

7. It is a simple matter to construct a difference table from a given 
set of data. 

Consider the following examples : 


Example 1. 

Construct a difference table from the following values, where is a 
function of x: 


X 

y 

Ay 

AV 

A“y 

1 

1 

7 



2 

8 

19 

12 

6 

3 

27 

37 

18 

6 

4 

64 

6t 

24 

6 

5 

125 

91 

30 

6 

6 

216 

127 

36 


7 

343 




Example 2. 





.Show that, in 

the following table of annuity-values, third differences 

arc practically constant : 




Argument x 

Imtry tij. 

An, 


A»u, 

35 

14-298 

--•54 



36 

14-144 

-158 

— *004 

- 4 - -001 

37 

13-986 

— -161 

— 'OO3 

-000 

3 « 

13-825 

— -164 

-•003 

-f -001 

39 

13-661 

— -i66 

' *002 

-f -001 

40 

13-495 

— -167 

— -001 


41 

13-328 





It will be observed that in Ex. i third differences are invariably the 
same. In the second example, however, third differences are not quite 
constant, although the error in assuming them to be so is very small. 
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The difference in the two examples lies in the fact that, while the 
first function is j = the table of annuity -values from which the data 
in the second example have been taken does not conform to a simple 
mathematical law and, further, the values do not naturally terminate 
with the third decimal place, but are rounded off at that place. 


Example 3. 

Assuming third differences constant, find the values of and from 
the data: 


X 

4 5 

6 

7 

8 


•35 -88 

1-71 

2*90 

4-51 

Construct the difference table from the given values, and fill in the 

vacant spaces 

in the A ^ Uj . column with the 

constant 

third difference. 

thus: 





X 




A’u. 

2 

- 05 

•11 



3 

+ 06 

•29 

•18 

•06 

4 

+ -35 

•53 

•24 

•06 

5 

+ -88 

•83 

•30 

•06 

6 

+ I-7I 

119 

•36 

•06 

7 

4 2 90 

i-6i 

•42 


8 

+ 4-51 





8. Now it has been stated above that a convenient method for 
expressing the difference between two successive values of a 
function and u„ is by the symbol A prefixed to so that 
AUo = Ma+ft-Ua. It will be seen therefore that to find Am„ wt per- 
form two operations: we change «„ to and subtract from 
it. The new function resulting from the first of these opera- 
tions is denoted symbolically by and the double operation 

may be written , „ 

^ Au„ = Eu„-u„. 

This gives Eu^ = “a + Au^. 

EUa may therefore otherwise be expressed as the sum of and 
its first difference. 
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Suppose that in either of the above relations the u„ which occurs 
in each of the terms be omitted. Then we can state that the two 
operations denoted by "E" and “A” are connected by the 
symbolic identity £— i+A 

It must be distinctly understood that we have not “factorized 
out” in the relation £«„ = u„ + A«„, and that we must relate the 
symbols to the functions on which they operate. If, therefore, we 
were using the equivalence AsA’— i, and we operated on the 
function sin x, it would be wrong to say that A sin ac = F sin x — i. 
'I'he correct statement is Asinx = £sinx — sin x. When we are 
dealing with symbols of operation we cannot treat any of them as 
quantities, and on forming the algebraic or trigonometrical identity 
the function must be included in all three terms. In other words, 
in the identity Zi = i+A the i is a symbol of operation just 
as are E and A, and its meaning is that the function on which it 
operates is to be taken once without alteration. 

9. In the same way as A^ denotes, when operating on a function, 
the difference of the difference of the function, i.e. the second 
difference, so denotes the operation of repeating E. That is to 

£'*m^ = £'.£'«x = -£'«i+a = «x+2i.. 

and, generally, E^u^ = 

Care must be taken not to confuse the expression Ehij^ with 
{Eu^Y- For example, 

(X*) = (,v + 2hf = .X* + 4 Ax + 4 A*, 
but (fix)* = (x + A)* = X* + 2/1X + A*. 

10. It is evident that the first difference of a function of the form 
cx, where c is a constant, is constant: for Afx = c(.x + A) — cx = fA, 
which is constant. 

Let us consider the effect of differencing a function of x of 
higher degree than the first. 
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Example 4. 

Difference successively the functions (i) y = bx* and (ii)_y = ajc*. 

(i) Abx^ = b{x + hy-bx^“ = zbhx + bh\ 

A^bx^ = A (zbhx + bh^) = zbh (* + A) + bh^ — zbhx — bh^ = zbh^, 
and since zbh^ is constant all higher differences will be zero. 

(ii) Aa!^ = a (jc + A)® — ax^ = -^ahx^ + ^ah'^x + ah^, 

A^ax^ = 6ah^x + 6ah^, 

and A^ax^ = 6ah^, higher differences vanishing. 

Collating the above results, we have that 
the first differences of functions of the form cx are constant, 
the second „ ,, ,, bx^ „ 

the third „ ,, ,, ,, 

It follows therefore that third differences of ax^ + bx^ + cx + d 
are constant, for before we reach the third differences the tenns 
bx^, cx and d will have been eliminated. 

11. The above considerations lead us to the following important 
proposition : 

If Kj, be a polynomial of the nth degree in x, then the nth 
difference of the function is constant. 

Let the function be 

Uj. = ax" + bx"~^ + cx"~‘^ + . . . + r ; 
then Au^ = a (x + h)" + b {x + h)"~^ + c (x + h)'‘ ~ + ...+s 

— ax" — bx"~^ — cx"'^ — ... — r 
= anx"~^h + b'x”~^ + c'x"~ ® + . . . + r ' , 

where b', c , ... r' are coefficients involving h but not *. 

Similarly, 

A*Mj. = an(n-i) + b"x"~^ + c''x" *->r...+q", 

and so on. 

Each time that we difference we lower the degree of the function 
by unity. After differencing n times no terms after the first will 
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appear, and we shall be left with 

(n - 1 ) (n — 2) (w — 3) ... 2 . i . A" or a« ! A", 

which is independent of x and is therefore constant. 

As a corollary we may note that A"+*m^ = o, a property of a 
polynomial of the nth degree which is of value in the practical 
application of the work. 

The converse proposition is of importance : if the (n + i)th differ- 
ence of a function is the first to become zero, the function is a 
polynomial of not more than the nth degree. 

12. It should be remembered that we are dealing here w'ith a 
particular form of function. Should the function be other than 
a polynomial the nth difference will not vanish however great n 
may be. Thus, we have 

Example S. 

Find the nth difference of e'. 

Ae* = =e^ (c'* — i ), 

A*e^ = — I ) (e^ * — c') = e* (e* — O’®. 

Similarly, = e-' (e* — 0^. 


Generally, A'‘e* = «■' (f'‘ - i)". winch is still a function of x, and is 
therefore not constant. 

13. Although it has been said that the symbols A and E are in no 
sense algebraic quantities, our definitions, namely that A" denotes 
the operation of differencing the function n times, and that £" 
denotes the operation of obtaining a new function when the argu- 
ment is increased by n unit differences, enable us to apply to these 
symbols the ordinary algebraic laws. For example, 

A (Ug. "i" Wy) I* + h tly, 

which is AMj.-!- AW y. This relation is exactly similar to the ordinary 
algebraic identity 3 {x -i-y) = 3X + 3y. 
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The three simple algebraic laws are the laws of (i) distribution, 
(ii) commutation, (iii) indices. 

(>) + + 

= i«x+h - “ J + (^x+h - ^x) + (“’x+A - «>x) 

= Aux + Av^ + Aw^+.... 

Similarly, 

E {Ux + v^ + w^ + ...) = Eu^ + Ev^ + Eio^ + . . .. 

(ii) The symbols A and E are commutative in their operation as 
regards constants. For if r be a constant, 

Acu^ = -cu^ = c - u^) = cAm^, 

and EcUj. = cu^_^,j, =cEu^. 

(iii) The application of indices to the symbols A and E may be 
shown thus: 

If m be a positive integer, then A*" represents the operation of 
differencing m times. 

A^m^ = (AAAA ... m times) u^, 

A” (A"*!/,) = (AAAA ... n times) (AAAA ... m times) 

= (AAAA ... m + n times) 

= A’"+"7/^. 

Similarly, E”‘u^ = u^^^,„ 

E> {E^U^) = mA = mh+nh = 

14. In connection with the law of indices we must be careful to 
define A"*, A”, . . . when m and n are not positive integers. So 

far, the symbols A™ and E'^ are intelligible only when we can 
actually perform the operations defined above and obtain the values 
of the new functions. We have not yet defined these symbols when 
the indices are negative. Consider for example the symbol A~^. 
Since we have assumed that the symbol A obeys the ordinary 
algebraic laws, A“^ must be such that A {A~^u^ gives A^u^, i.e. u^. 

Let m be a positive integer. Then w'c define A^^'u^ as a function 
such that if it be operated on by A"* the result will be A^'^u^, 



PROPERTIES OF OPERATORS 


II 


i.e. Uj.. In the same way we have a meaning for namely, 

that operating on produces u^. But if m be a positive 

integer, £> operating on produces i.e. u^. Therefore 

the same result is obtained by operating with J?"* on E^’^u^ as on 
In other words just as E’"u^ gives so E-'"u^ gives 

The symbols E and A may be manipulated in a manner similar 
to algebraic quantities provided that it is always remembered that 
they are operators and that they have no actual values. There are, 
however, two important points in which algebraic precedent cannot 
be safely followed. These are; 

(1) Operators are not commutative with regard to variables. 

E.g., A (Uj-i’a.) does not as a rule equal MjAi^j.. 

(2) It is fundamental in algebra that if a function vanishes, then 
one of its factors must vanish. It is not true that if the result of a 
series of operations on u^. is equivalent to o.Uj. (i.e. zero), then 
some one of the operations on must produce o . u^. For example, 
if x^ — o, then x = o\ it does not necessarily follow, however, that 
if A* = o, then A so. 

In many problems it is convenient to use operators alone and to 
omit the functions on which they operate. Where this practice is 
followed the sign s (is equivalent to) should be adopted in place 
of = (equals). Thus, Eu^ = {i +A) u^., but £'s(i +A). 

For further information on the difficulties connected with the use of 
operators the student may refer to J.S.S. voJ. ii, pp. 237 et seq. (S. H. 
Alison). 


16. Proceeding from the definition of differencing, it has been 
shown that 

= Uj, + AUj. + A (u^ + AWj.) 

= Mj, + 2 AMj. + A*Hj, 


+ JA d" Am 

= Mj. + 2 Am, + A*M^ + a (Uj. + zAw^ + A-mJ 

= M, + 3 Am^ + 3 AX + 


rh 
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The coefficients of the various terms in these expansions are the 
coefficients of x in the expansions of (i+x), (i+x)*, (i+x)* by 
the binomial theorem. If we assume, for positive integral values of 
n, that the general relation between and and its differences 
follows the same law, we can prove the truth of the assumption by 
the method of mathematical induction. 

Assume therefore that 

Ux+nh = + • - • + -r . . . + ^’'U^ 

is true for the value n. 

Then, since «x+(n+i)A = «x+nh + 
we have 

“;c+(n+l)A = “x + ”a>^“x + ”(2)‘^'‘Wx + + ”(r)'^'’“x‘'' • • + ^"“x 

+ A (u^ + «(!) Au, + «(2) A^Uj, + . . . + M(,, A’-«j, + . . . + A'*u,) 

= “x + A«x (%)+ + (”(2)+”a>)+--- 

+ A'l/^ (n (, , + « ,) + ...+ A’‘+ 

But «(r) + «(r-l) = (”+l)(r). 

therefore 

«x+(fl+l)ft = Kx + (” + l)(l)-^“x + («+0(2)^^«x+---+(”+0(r)A’’Mx 

4 - . . . + A"+>«„ 

which is of the same form in (« + 1) as was the original expression 
in ft. 

Therefore if the assumption is true for n it is true for n + 1 . 

But the theorem holds when n= i, 2, 3. 

Therefore it is true when 71 = 4, 5, ... and for all positive integral 
values. 

Therefore, for positive integral values of n, 

«x+nA = «x + «( 1 ) A«x +«( 2 ) A^«x + «(3) AX + . . . + «(,)AX + • ■ • + A"U^. 

16 . When the relation between the operators A and E was dis- 
cussed it was stated that our definition of these operations enables 
us to apply the ordinary algebraic laws to these symbols. We may 
therefore use the equivalent relation 

£■= 1 + A, 
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and if we operate on the function we shall have 

= (i+W(j)A + n(2^A*+..-+W(r)A’’+...+A") 

If we introduce the fact that the symbols follow the algebraic 
distributive Ipw, we may write 

"x+nft = “x + ”(1) Aw* + >1(2) AX + ■ • • + W<r ) A^W* + . . . + A"U^, 

which is the relation proved above for positive integral values of n. 

This result is true whatever the form of the function so long as 
n is a positive integer. If n be other than a positive integer we 
cannot adopt the binomial expansion without further investigation. 
For the purposes of this chapter it will be sufficient to assume that 
the relation £’" = (1 +A)'‘s i +n(,,A + «, 2 )A^ + n( 3 )A®+ ... holds 
without restriction. The conditions of the validity of the expansion 
will be discussed at a later stage (see Chap. II) 

17. We are now in a position to state that if «+i consecutive 
values of a polynomial of the nth degree are given, then, by tlie 
method of finite differences, we can obtain the actual function in 
the form 

u* = Wo + Aw„ + .V( 2 ) AX 4- . . . + (o) AX. 

or Uj. = A + Bx(i) + Cx( 2 ) + . . . + Kx ^„ ), 

where the coefficients A, B, C, ... K are obtained by inspection of 
a table of differences. 

Now if we are given n + i corresponding values of x and w^. it 
does not immediately follow that Uj. is a polynomial of the nth 
degree. 

For example, suppose that we have the following data: 

X o I 2 3 4 s 

Mj. I 4 9 16 25 36 

Since six values are given there are the following possibilities : 

(i) they may be actually given as values of the function (j + x)®; 

(ii) they may be given as values of a polynomial of the second 
degree in x, and it may be required to find the function; 
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(iii) they may be given as values of a polynomial of the nth 
degree, where n is less than 6, and it may be required to find the 
function ; 

(iv) they may be given as values of a polynomial without any 
information as to degree ; 

(v) no information regarding the nature of the function may be 
available. 

The answer to (ii) and (iii) is obviously u^ = {i 4 -x)*. 

The answer to (iv) is u^ = (i +x)* + ^ x (x— i) ... (x— 5) or 

(i +x)* + X(g)F^, where F_j is a polynomial in x which does not 
become infinite at any of the points o, i, 2, 3, 4, 5. The function 

^,x(x~i) ... (x-s)F^ 

will then obviously vanish for these values. 

The answer to (v) is the same as to (iv) except that need not 
be a polynomial. 

It is of importance to realize that we can always find a value for 
Fj. which will make the function = ( i + x)* + X(8) F^ agree with any 
additional value whatever. For example, if u^.^ = i9’75 the function 
«^ = (i +x)^ + X(g)2® will agree with the given values and also with 
the additional value which has been inserted at the point x = 4-5. 

Conversely we can say that whatever be the complete form of 
the function of which the six given values are samples, the value 
at any other point is the value of the function (1 +x)^ at that point 
with an error X(g)Fj.. Whether the value is a good approximation 
or not depends on the magnitude of F^., and we may or may not 
have reason to suppose that F^ is so small that it can be neglected. 
It should be understood that we are not at liberty to say that 
(i +x)® gives an approximate value at the point in question unless 
we can give such a reason, based either on theory or on experience. 

The matter is further investigated in later paragraphs, but it may be 
said that in most practical cases is of the same order of magnitude 
as A"+h/* for some value of x in the range under consideration. It may 



EXPANSION FOR 


IS 


in fact be shown that = ^ where f is a quantity falling in the 

range which includes x and the given values of u. (See Chap. Ill, 
paragraph 17.) 


18 . If instead of writing +A)" and expanding this by the 

binomial theorem, we write A"s(£— i)’* and expand, a new series 
is obtained: 

AX = (^^-i)"«r 

= [£" - + n(2)£'"-* - W(3) +...+{- I )'• 

1 ) A ^(2)*^x+(rt— 2) A ~ - 

+ ( - O’" «(r)«.r^(n-r)fc + ... + (- l)" «x- 

Just as the relation established in paragraph 15 enables us to 
obtain the value of in terms of and its leading differences, 
so the above relation gives any required difference of the function 
Uj in terms of successive values of the function. 

19 . A few simple illustrations of the use of these two formulae are 
given below. 

Example 6. 

Find ttg, given Wp=-3, u^ = 6, U2 = S, u,= i2; third differences 
being constant. 

The leading differences are easily found to be Ano = 9; —7; 

A\ = 9. '■ 

Ug = (t +A)® Uo = (i +6A + i5A* + 2oA’) Up ^ , 

= «„ + bAup + 1 5 A-u„ + aoA’up 
= - 3 + 54 — 105 + 180= 126. 

Note. There is no need to continue the expansion beyond third differ- 
ences, as further differences are zero. 

Example 7. 

Find ttj, given iq = o, Ug = 3, u, = 9 ; second differences being constant. 
Here the initial term of the known series is Uj, so that in order to find 
Uj we must use the relation 

“» = «i-» = = ( I + A)-* «* = ( I - zA + 3 A^) u^, 

as far as second differences. 
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M2 = tt4-2A«4 + 3A^M4 
= o -6 + 9 = 3 , 

since A «4 = 3 and A *//4 = 3 . 

Example 8. 

From the following values of u„ calculate A^Uq: 

«^z=3, «j=i2, U2 = 8 i, 1/3 = 200, U4 = ioo, «b = 8 . 

Since we require only one value of A^u^, we do not need to form a 
difference table, but may write at once 

AX = (£'-i)*f/„ 

= — 5E* + loA’^ — loE^ + 5/i - 1 ) Wfl 

= - sF^(/o+ io£*«„ ~ ioE~Uq + 5A'ji„ - Uo 


= “5-5“4+ I0“S- IOW2 + 5(ii-I/o 

= 755- 

A^ole. Before we can find the fifth difference six terms of the serjes 
must be given. 


20. Separation of symbols. 

In obtaining the value of E’^u^ in terms of and its differences 
we have used the symbolic relation F= i + A and have expanded 
(i +A)" by the binomial theorem without introducing the function 
1 / 3 . until the last stage. This method, in which in fact is omitted 
from both sides of the identity, is known as the method of separation 
of symbols, and enables many relations involving and differences 
of to be readily established. It must however be remembered 
that the operators cannot really stand alone and that the operand 
is alw'ays understood. 

Example 9 . 

Show that 

J/0 + ttj+I/2+...+U„ 


= (« + I )( 1 ) «0 + (« + 0(2) Amo + (« + l)( 3 , A*Ho + . . . + A"«„. 



SEPARATION OF SYMBOLS 


= Mo4-£'«o + J?X+ ■■• + ^"“0 

-{\+E + E^-\- ■■■+ •£■'') Mo 

^W+1 __ j 

= Mo, or substituting i + A for E, 

E— 1 

(n-A)"+i-i 

= 

— ^ [1 +(”+ 0(i)'^ + ("+ + (”■*■ J)(3)^° + ■•■ +A"+* — i] Mo 

= [(” + l)(l)+(”+ l)(2)'^ + ("+*)(3)'^^+--- +A"] Mo 

= (« + I )(1) “0 + (” + I )(2) + (« + I )(3) + • ■ • + 


Example 10. 

Prove by the method of separation of symbols tliat 

Mj = Mj._i + AMj_j + A-Mj_3 + A“Mj._4 + ... + A"“’Mj._„ + A”M3._„. 


I /A\’‘i 

«X - A"m^_„ = m^ - \’‘E~”u^ = ■! I - I- M, = 

( \£./ 


£" - A" 


J 




I |£"-A") 




since £■ — A = i, 


E” ( £-A 1 
= fe'-” (£'"-1 + A£’’'-* + A'^£’"-» + . . . + A"-") K, 
= (£-' + A£'-2 + A'=£'-» + . .. + A’‘-'£-") 


= f Amj_2 + A'‘Mj._3 + ... + A"“^m^_„. 

.-. M, = + AM^_a + A-m^ _3 + ■ . . + A"-iu*_„ + A"m^_„. 

Since this is true for all values of n we have the convenient formulae 

) 

Mj. = Mj_j + Att^_j (which is otherwise evident), 

Mx = «x-i + Am^_j + A2m^_2, 

«i = M^_i + Am^_j + A^m^_3 + A®m^_3, 

and so on. 


Example 11 • 

Obtain a formula based on m„ similar to that given by the relation 
£*Mo = (i+A)* a,. 


r M A s 
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l8 


E’‘u, 


" U„ = 1^-^' ^ j since £ - A = 


'E 


= (i -A£-i)"-* i/„ 

= [i - (n - x),!, A/i-i + (n - x), 4 ) A==£-2 

«, = w„ - (« - Aw„_i + (« - A.), 2, AX-! - ■ • ■ • 


It will be found that this is an ordinary formula which could be 
obtained by using the values in the reverse order u„, m„_i, «„_ 2 , ••• «(,• 
There is as much justification for using one order as the other. It 
should be noticed that the same numerical values appear in the table of 
differences, but that they are in the reverse order with a change of sign 
for the odd differences. This should be tested by a numerical example. 
If *> w, we may use the general relation 

and write the formula as 


u^ = u„+(A-«)„,An„_, + (A-n+ i),g,A*//„.8+ .... 
where the coefficients are positive and are those in the expansion of 


Example 12. 

Find the value of 

Aa™ - i A^a™ -(- A^a™ _ ' • 3 • S ^4^.m + . , , to m terms. 

2-4 2.4.6 

Since A™+^a™ and higher differences of a"* are zero, the sum of the 
series to m terms is the same as the sum to infinity. 

Omitting the function a™, and working on symbols alone, we have 

A_ia»+‘'3 A3-‘'3| (i-,iA + ---^ A»- A® + ...\ 

2.4 2.4.6 V ■ 2.4 2.4.6 ' 

= A(i-JA+yA*-^^-^A=‘ + ...j 

= A (i+Ari = A£-i. 

The value of the given series is therefore 

Air^A™ = A (a - D™ = (a + .})"* - (a - i)”*, 
if the interval of differencing be taken as unity. 
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Further examples of the application of the method of separation 
of symbols to the operators A and E and to other operators will be 
found in Chapters VI and VII, 


21. Factorial notation. 

For many purposes it is useful to use a notation for the product 
of m factors of which the first is x and the successive factors decrease 
by a constant difference. 

Generally, 

x<f^^~x{x — h){x — zh){x — '},h ) ... (x — tn— ih). 

For convenience in working we shall take h=i. Fhen, if 
jfim) = * _ I ) _ 2) (jc - 3) . . . (x - W2 - I ), 

Ajc(”‘> = (x+ i) x(jc— i) ... {x — m — z)~x{x— \ ){x — z ) ... (x — m — i) 

= inx (x — i) ... {x — m — z) 

_ 

Similarly lZ^x^”' ’ = m (m - i ) 

and, eventually, A'”x<’"' = m!. 

Again, from the definition of 

= (a; — + I ) 

.'. when rw=o, a;*”' = (x: + i ) 

By convention, a:'“' = i. 

i-(x+i)x^-'^ 


or 

When m = — I 


so that 


x+ I 


.V ■ ** I 

X + 2 {.v+ i) (.r+ 2)‘ 


Generally 


m) _ 


{x+ 1 ) (x + z) ... {x + m) {x + m} 


m)* 


This notation, adopted by Aitkcn and Milne-Thomson, differs from 
that used in Boole’s Finite Differences, Steffensen’s Interpolation and 
Freeman’s Actuarial Mathematics. It has the advantage that, for any 
value of X, the relation 
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is valid for all integral values of r and m, whether negative, zero or 
positive, provided that division by zero is not involved. 

By proceeding as above it can be shown that 

{m + i) 

and so on. 

It should be noted that the result of differencing is to 

increase the degree of the denominator, and that, as a result, 
jg constant. 

A special case of is where a: is a positive integer. We have then 

(*— i) (.V- 2) ...(* — »!+ l) = Jc!/(A:-m)! or “P,,,. 

It is also of interest to note that the result of differencing is 
analogous to that of differentiating x’“. We have 

Ax*”*’ = and Dx”‘ = 

A’"a:‘™) = otI and D” *'" = ?«!. 

Similarly we have 

A’’a-, and 
These relations are very important. 

Note. In the demonstrations above Ax has been taken as unity. If 
Ax = h, then Ax^-'"^ = A-x''"^ = m {m— 1) h'^x^"'-^', and so on. 

The general principles are the same. 

Example 13. 

Express za:^ — 3 ar^ 4 - 3 x— 10 and its differences in factorial notation. 
Let 

«j. = 2 a :® — 3 a :^ + 3 x — 10 = Ax (a: — i) (a:— 2) + Pa (a — i) + Ca + /). 
Putting A = 0, I, 2 in succession, we obtain easily that 
D=-io; C=zi P = 3. 

By equating coefficients of a^ on both sides of the identity we find 
that 2I = 2. 

2A^ - 3a“ + 3a - 10 = 2A*®* + 3A**’ + 2A**' - 10. 

Au^ = 6 a '^> + 6 a '^> + 2 , 

A^Kj,= I2A<** + 6, 

A®U*= 12 . 


and 
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22. An alternative method for expressing P„{^) in the factorial 
notation is by use of detached coefficients. By this method any such 
function can be written down in the form 

+ + ... +K 

with very little trouble. 

The principle can best be illustrated by an example. 

Example 14, 

Write down i ix* + 5X® + 2x‘^ + x — 15 in factorial notation. 

Let 

Uj.= I ix^ + 5x® + 2X- + X— + + Cx'-> +Dx'^> -hE 

= /lx (x - 1) (x - 2) (x — 2) + Ex (x — 1) (x-2) 

+ Cx (x - I ) + Dx + E, 

If we divide by x, the quotient will be 
iix® + 5x- + 2x+ I 

and the remainder —15 =E. 

Divide I ix^ + 5x* + 2x+ i by x — i : 

x-i j ux^+ 5x'’+ ?x+ I iix-+i6x+i8 

I IX*- I IX* 

i6x*+ 2x 
i6x* — i6x 

i8x+ I 
i8x— 18 

19 

Divide iix*+ i6x+ 18 by x — 2; 

x-2 iix*+i6x+i8 iix + 38 

IIX* — 22X 

38X+ 18 
38X-76 

94 =C, 

and so on. 

The above processes may be appreciably shortened by adopting the 
following procedure; 

(i) omit the x*, x*, x*, ... and work on coefficients alone; 

(ii) change the sign of the constant term in x— i, x — 2, x — 3 

so that addition takes the place of subtraction. 



22 


FINITE DIFFERENCES 


The required remainders can then be easily obtained. Thus: 


1 

I I 

5 

2 

I 


0 

11 

16 

18 

2 

II 

16 

18 

>9 


0 

22 

76 

3 

■ II 

38 

94 



0 

33 



4 

II 

71 




0 





1 1 


Ux=i IX* + 5*^ + 2^:- + *— 15 = 1 + 71 + 94.t'’’ + iQ.v'i* — 15. 

This short method is the method of detached coefficients and 
is of particular advantage in solving certain problems in summation 
of series (see Chap. VI). 

23 . It is often simpler, and in some cases more practically useful, 
to express P„{x) in terms of X(,), ... rather than in terms of 

x{T)^ ^(r-1) 'phjg easily be done by evaluating u■^,u^, ..., 
forming a difference table and then using the formula given in 
paragraph 15. Thus, using the polymomial in Example 14, we have 


X 


Ao, 

A’h, 


A*u, 

0 

-15 

19 




I 

4 

207 

188 

426 


2 

211 

821 

614 

690 

264 

3 

1032 

2125 

1304 



4 

3157 






whence + 426x'(3) + 1 88x(2) + i q-Td) —15. The arithmetic 

can be made even simpler by using m_i, Uq, u^, Ug ... and 

inserting the constant difference for the purpose of obtaining the 
leading differences of u^. In the case of the polynomial considered 
above, the constant difference will be and it is instructive for 
the student to rework the example on these lines. 
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By constructing difference tables, find : 

1. The sixth term of the series 8, 12, 19, 29, 42 

2. The seventh and eighth terms of the series o, o, 2, 6, 12, 20, .... 

3. The first term of the series whose second and subsequent terms 
are 8, 3, o, - i, o, .... 

4. The entry corresponding to the argument 3 from the table: 

X (argument) 5 6 7 8 9 10 

_v (entry) lo-i i8-i 29-5 44-9 64-9 90-1 

5. The tenth term of the series 3, 14, 39, 84, 155, 258, .... 

6. Given that y = — jc^ + a: + 10, verify by constructing a difference 

table that the value of y when jc= to is 920. Use the following values 
of x: I, 2, 3, 4, 5, 6 and the corresponding values of y. 

7. Prove that u^ = Uy + Aiu + A*«i + A^«,. 

8. Find A^u^, where i/j. = a.r* + Ax* + cx + and the interval of 
differencing is h. 

9. M, is a polynomial in x, the following values of which are known : 
lij = U3 = 27 ; tt4 = 78 ; Uj = 1 69. Find the function u^. 

10. Obtain A’® [(i — ax) (i — Ax*) (i — cx®) (j — t/x*)]. 

11. Find AflA”* and A*aA^-'. Hence sum the first ten differences 
of alf^. 

12. What are the functions whose first differences are (i) x; (2) c*; 
(3) 9x» + 3.’ 

13. Mj. = (5X + i2)/(x* + 5x + 6). Find Am;j and A*Uj. 

14. Uj. = — (x — i)“' (x — 2)“‘. Find Auj. 

15. tt, = (i2-.v) (4+x); ttj = (5-x) (4-x); tt3=:(x+ 18) (x + 6); 

114 = 94. Obtain a value of x, assuming second differences constant. 

16. Find A"Uj., where is (i) ax" + Ax"~‘, (2) 

17. Show that i/4 = «„ + 4Au(, + 6A*u_, + ioA®m_j as far as third 
differences. 

18. The first four terms of a scries arc o, 5, 16, 30. Find the sixth 
term, using the relation in Qu. 17. 

19. Find the value of A* 


L^-iT*. • 
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20. Obtain the function whose first difference is 

X* + 3*^“ + 5JC + 12. 

By means of the relation u^ = {i +A)* find 

21. ai2 given = 3 ; 14; Wj = 4o; M3 = 86; M4 =i57; M5 = 258. 

22. Me given Mo = 2S; Mj = 25; u^ = 22; m.j= 18; M4= 15; 15. 

23 . Ug given Mq = i ; Mj = 1 1 ; Mj = 2 1 ; 1/3 = 28; M4 = 29. 

24. The tenth term of the series i, 37, 61, 77 

25. The eleventh term of the series i, 4, 13, 36, 81, 156, 269, .... 

26. Prove that the rth difference of a polynomial of the nth degree 
is a polynomial of the (n — r)th degree if r < n. What happens when 
(i) r = n, (2) r>«? 

27. Define the functions x*™* and x*“™’. Obtain their nth differences, 
distinguishing between the cases when n = m. 

28. Represent the function x^— i2x® + 42x''* — 30X + 9 and its succes- 
sive differences in factorial notation. 

29. Find A"Mj. where Uj. is 

(i) (ax + b) {a.x + 1 +b) (a.x + 2 + b) ... (a.x + m + b) given m>n. 

(ii) [(ax4-i) (a.x+ i -(-6) (a.x-f-2-t-6) ... {a.x + m-^b)]-^. 

30. Obtain A sin x, A tan x and A (x-t-cos x) where the interval of 
differencing is «. 

/A" 

31. Explain the difference between ( ^ ) m^, and * and find the 
values of these functions when m^ = x*. 

32. Mi, = sinx. Show that A’'Mj, = A£'Mj. where ^ is constant. 

{ h ') 

33. Prove that A (tan~‘ x) = tan“* ! , ;}■ , where h is the 

' ' {i+xh+x^\ 

interval of differencing. 

Use the method of separation of symbols to prove the following 
identities : 


34. «jX-f MjX^-fMaX®-!- . 


_ X “1 ( I _ *)2 ^“1 + (i -xf 


35. A"«J._„ = M^ - «(!, + «,2, M^_J - n,3, M,_3 -t- . ... 

36- + X(1, Am„_i (x -h i),2, A*«„_, + (x -h 2), 3, A»m„-s + •••• 
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37. «o + *„ ) A«i + ae,j) A^a, + x,3, A»«3 + . . . 

= u^ + jc, 1 ) A=*a^_i + x, 2 , A<M,_j, + . . .. 

38 . «o+^,+ ;i + 3 , +... 


39 - “x-“x+l + “x-rS 


ao + jcAtto + ^ Ai^tto + ^- 1 AX + • ■ • 
-“x, 3+---2 A"«x-} 

40. a2„-H,i,.2a2„_i + «,2,.2Xn-2---- + { “ 2 )" «„ = ( - I )" (c-2an), 
where Uj. — ax^Jt-bx + c. 


41- «x 8 ^‘““^-1 + 8. 16'^^''^^- 8. i6r24‘^*“-'-“ 

= «xfi - 'Aa^^j + iA^aj,^ - JAX+i + ■••• 

42. Find thejTelation between a, fi, y in order that x + ^x + yx'^ may 
be expressible ip one term in factorial notation. 

43. Sum to n terms 

1 . 2Ax" - 2 . 3 A*.«" + 3 . 4A V* - 4 . 5 A 4 v” + . . . . 

44. If a, be a polynomial in x of the third degree and Aa:= i, prove 

that ^.lit 3.(3( 

«x = “u + ^Aao + -, AX+ 3 I AX- 

45. Prove that 

a„-f nii,UiX + ti|j, a2xXai3)t<3X^4- ... 

= (i +x)'* «„ + n,ii (i +A-)'‘-‘ .vAa„ + a,2) (i +.v)"-* .r*AX + ---- 

46. If n be a positive integer, prove that u„ is the dilference between 
the two series ; 

n,, , a, + (« 4- 1 )i3, AX + (« + 2)(5, A<a_, + . . . 
and (a- i)mao + n,3,A-a_, +(n+ i),5,A*a_j+ .... 



CHAPTER II 


INTERPOLATION WITH EQUAL 
INTERVALS 

1 . Interpolation may be defined as the operation of obtaining the 
value of a function for any intermediate value of the argument, 
being given the values of the functions for certain values of the 
argument. The process has been picturesquely described by Thiele 
as “ the art of reading between the lines of a table Where the form 
of the function y = is known or can be deduced from the given 
values, the ordinary algebraic process of substitution can be used 
and the required value obtained with little difficulty. In actuarial 
work the relation connecting the function and the independent 
variable is seldom simple or evident, and it is then that recourse 
must be had to finite difference methods. 

2 . Before proceeding to examine the practical aspect of inter- 
polation the question of negative and fractional values of n in the 
expression (i -I- A)” must be considered. The proof of the identity 
“x+nA — by means of operators or by induction, as in the 
previous chapter, ceases to have a meaning if n is negative or 
fractional, since the reasoning assumes that the argument x advances 
by steps of A at a time. The assumption that, so long as the ordinary 
algebraic rules are followed, the expansion is true for all values of 
the quantities involved and not only for certain specified values, 
is not necessarily true, and an analogy can be drawn between the 
application of the binomial theorem to algebraic quantities and to 
operators. For example, the expansion (i + x)" is only convergent, 
i.e. is arithmetically intelligible, for negative values of n, provided 
that X is numerically less than unity. 

Thus (i — x:)~*= I -I- 2jr-i-3Jf*-f-4Jc®-l- ... ... leads to an 

absurd result if we put x = 2, for then we should have 

(- I ■+4-H2 4-32-t- ... 
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expansion of (i + A)"u^ 
which is impossible, since 

(-l)-2=l/(-l)2=I. 

Similarly, in some cases there is no possibility of expanding 
(i + A)" Mj. by the use of the binomial tlieorem. 

Consider the two following series of corresponding values of 
X and Uj.: 

(i) .V o I 2 3 4 5 

M, 1 4 9 i6 25 36 

Then to find the value of, say, we shall have 
«/j = £;*Wo = (i+A)i «o 

= i A«o + A^Wo + . . . . 

The leading differences are Am„=3 and A*Mo=2, higher differ- 
ences being zero. 

• • «J = I + ^3 + ---^- -2 

= »+|-i = S. 

which is otherwise evident, since is (i + and, for the value 

(ii) At o I 2 3 4 5 

«x J 5 25 12 .“: ^>25 3125 

This is evidently a geometrical progression and the function 
from which the values are derived is v = Mj= 5^- If we attempted 
to express as a polynomial in .v so that 

u^ = a + bx + cx^ + d.\^ + 

and then applied the relation £'"«o = (1 -I- A)" Uj, for the value 
n = J, we should obtain as above 

Mj = £*«o=(i+A)*Ko 

~ + + ^ A*h„ 4 - .... 

Here the leading differences are 1,4, 16, 64, 256, ... and tend to 
become successively larger. But Mj = 5* = 2-24 approximately, and 
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this cannot be the same as the divergent series found by expanding 
(i+A)*Mo. 

Hence unless the function is capable of being expressed as a 
) polynomial in x we cannot use the relation E^ = {i +A)" for the 
value n = |. 

3. Let us now consider the problem more generally. If be a 
polynomial of degree k in x, we may write 

Uj. = a + bx + cx^ + . . . +px'^. 

If we adopt the binomial expansion of (i + A)" for expressing 
in terms of Uq and the leading differences of «o, namely, 

U^ = Uo + X(i)Alio+X, 2 )AX+ + ••• 

we have two series for which are equivalent for more than k 
values of the variable, since they are true for all positive integral 
values of x. 

Hence by a well-known algebraic theorem, they are true for all 
values of x, positive or negative, integral or fractional. 

Therefore so long as is a polynomial in x the binomial expan- 
sion is valid for all values of x. It is not necessarily valid for other 
forms of function, and we are led to the conclusion that we can 
expand terms of Aw^., A*ttj, ... A'^Wj. ... for all forms of 

the function if n be a positive integer, but for other values of n only 
if is a polynomial. (Cf. Chap. 1, paragraph 17 .) 

4. All finite difference formulae which are employed for the pur- 
pose of interpolation are based on the hypothesis that the functions 
in question can be represented by polynomials, i.e. by rational 
integral functions, with sufficient accuracy for the purpose in hand. 
This assumption is the justification for extending the formulae 
to fractional intervals; the processes to be explained in this and 
subsequent chapters are simply various methods of carrying out 
the calculations based on these assumptions. One special advantage 
of these methods is that it is unnecessary to fix in advance the 
degree (n) of the polynomial; the interpolation formulae will be 
in such a form that to increase n will merely involve the intro- 
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duction of a fresh term without affecting the other terms. The 
introduction of additional terms however will not necessarily im- 
prove the approximation or justify the assumption, although it will 
generally do so. Fortunately, in actuarial work the functions with 
which we deal are usually such that the assumption is sufF.ciently 
accurate for our purpose. 

5 . In applying the formula in paragraph 3 to a given set of data 
the following points should be noted: 

(a) If the basic curve is y' = a + 6x-l-c.r^-l- ... there will 
be n constants, and in order to determine these constants n equa- 
tions are necessary. For there to be n equations, values of y 
corresponding to n values of .v must be given. Therefore either n 
points on the curve, or n other corresponding relations between x 
and y, must be known. In that event the curve will be of degree 
n — I, and wth and higher differences are zero. 

(h) Our investigation has been confined to equidistant values 
of the argument. If the given values are not equidistant a formula 
slightly different in form from the expansion (i-I-A)^Mo can be 
developed with a modified method of differencing (see Chap. III). 

With regard to the statement (a) above that for a curve of degree 
n~ 1 there must be n facts given, it is not essential that n points on the 
assumed curve must be known. We may have given, for example, three 

points and two values of the differential coefficient . Here we have 

lix 

five facts ; we assume therefore a fourth degree curve, so that fifth and 
higher differences are zero. 

6. Newton’s formula. 

The formula = Mj + W(,^AHj.+ n(2)A*M,+ ... is known as 

NewTon’s formula, and is the fundamental formula for inter- 
polation when the given values are at equidistant intervals. The 
expansion can be applied to solve many forms of the problem of 
interpolation. 

The following variations of the problem may arise : 

(i) Where there are n equidistant terms and it is required to find 
an intermediate term. 
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(ii) Where there are n equidistant terms of which n — i are known 
and it is required to find the missing term. 

(iii) Where there are n equidistant terms of which n — r are 
known and it is required to find the r missing terms. 

Note. Some modern writers have adopted the name “ Newton- 
Gregory formula” for the above expansion, as the first publication 
appears to have occurred in a letter from Jm^s Gregory to John Collins 
on 23 Nov. 1670. The letter is given ana the question of Newton’s 
priority is fully discussed by D. C. Fraser xnJ.I.A. vol. lii, pp. 1 17-35. 


7 . Examples of the variations referred to above are given below: 
they are all solved by assuming the last difference constant. 

Example 1. 

The values of annuities by a certain table are given for the following 
ages; 

Age X 25 26 27 28 29 

Annuity-value a^. i6-i95 I 5 ‘ 9 i 9 iS'bjo 15326 15006 

Determine the value of the annuity at age 27^. 

Five values are given ; we must therefore assume that fourth differ- 
ences are constant. The difference table is 


X 

a. 


A»a, 



25 

16195 

- -276 




26 

15-919 

- -289 

-013 

— '002 


27 

15-630 


-015 


+ -001 

28 

15-326 

-•304 

— -016 

- 001 




-■J20 




29 

1 5 006 






The leading differences correspond to the argument * = 25 and we 
require the entry for age 27J. Our formula is therefore 

a„^=E^i a 25 = (i -t-A)*f 


. 2 5 X 1-5 2-5 X 1-5 X ’5 

I + 2-5A + — - -- A-* + — ^ A® 
2 6 


24 

= + a-sAfijs + i - S - jsA ^ a ^ + -3 i2sA«a,* - •o39o6A«a,j 

= 1 6- 1 95 — -6900 — •0244 — -0006 — -00004 
= 15-480. 


^46 
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Note. Since the data are given to three places of decimals, sufficient 
figures have been used to give three places only in the result. Since our 
interpolation is based on an assumption, namely, that fourth differences 
are constant, a result to more tljan this number of decimal places would 
be unjustifiable. 

Example 

From the kilowing data find the value of u„ : 

« 4 «=i 9 'lfc 4 ; “48=19-5356; “ 49 = >9-6513; “.,0=19-7620. 

We cannot form a difference table, since the given terms are not 
equidistant. As however four terms are available we may assume that 
third differences are constant, and that as a consequence fourth differ- 
ences are zero. 

If the function is y — Uj. we assume therefore that = o whatever 
the value of x. We may write 

= o, 

i.e. (/;■- i)* I 44 c = o, 

or {E* - + (>E~- 4^+ i) u^=o. 

i .e. 1/50 - 4«4, + 6M4,, - 4U4. + Uta = o, 

so that 197620 - 78'6 o 52 117-2136-4144, + 19-2884 = 0, 
from which u,, = 19-4147. 

Note. As mentioned above, all the given terms are not equidistant. 
The method however depends upon the fact that the term required 
makes up in all five e(]uidistant terms. 

Example 3, 

Complete the following table ; 

X 2-0 2-1 2-2 2-3 2-4 2-5 2-6 

-135 -111 -too -082 -074 

This is similar to Ex. 2. Instead of using the assumjition once that 
A^Uj. = o, we write down two equations of the same form, thus 

= 0, so that (£ - I ug.o = o, 
and A“«,.4 = o, „ (F- 1)® u,., =0. 

Our two equations then become 

“9.5 ~ 5 “ 2-4 >0144.3 — 10142.2+ S“2-I ~ “a-0 = 0> 
and Wj-o - + lotij.^ - louj.j + 5x4.2 - x,., = o, 

since the interval of differencing is o-i. 
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Inserting the known values of and solving, the required values are 
easily found to be u^-i = '123 and 1/2.4 = •090. 

Note. The function in this question is y = and the tabular value 
of 1/2.4 is -091 correct to three decimal places. This difference is due to 
the fact that our assumption that the curve y — u^. is a polynomial of 
the fourth degree in x is only approximately true. 


8. Change of origin and scale. 

If we had plotted the curve y = u^on which the values of in, 
say. Example 2 were assumed to lie, we should have had values of y 
corresponding to values of x at 46, 48, 49, 50. Precisely the same 
curve would result, however, if we changed the origin of our co- 
ordinates so that 46 tvas represented by the value v = o, 48 by x = 2, 
49 by x' = 3 and so on, the unit of .x being unaltered. This process 
of changing the origin simplifies our notation considerably. In the 
examples above we could have changed the origin alone, or both 
the origin and scale, and could have altered the questions to read : 


Ex. I. Origin at age 25 

Unit of differencing i year of age 
Ex. 2. Origin at ^ = 46 

Unit of differencing x= 1 
Ex. 3. Origin at a; = 2-0 

Unit of differencing x = i 


Given “j,-' "4 

Required u^i. 

Given Ug, u^, w.i, M4 
Required Mj. 

Given u^, i/.,, 1/3, i/g, 
Required t/j and u^. 


9 . If in Newton’s formula 

= “x + « ti) -^^“x + ” (3) ■^■’’«x + • • • 

we put h=i, x = o, and replace n by x, we obtain the series 

«X = “0 + ■ 

This is generally called the advancing difference formula, and gives 
i/j. in terms of Wq and its leading differences, where the interval of 
tabulation is treated as the unit abscissa. 

If, however, we wish to obtain in terms of and its leading 
differences, we may write the formula 

Wx = «-m+(m+x) = ^’"+“«-m = (* +A)"*+*M_„ 

— m + (m + A:)(2) A»i/_„ + ... 

+ (wi + x)(,) A’’i/_„ + .... 
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It is often more convenient to use this formula than to obtain 
in terms of Mq and differences of i/q, the advantage being that thereby 
we can make use of values of the argument on either side of x. 

In some cases, particularly when dealing with the summation of 
certain series, it is expedient to represent the first term by Wj rather 
than by Mq. If in Newton’s formula w'e put h^i, x='i w^e have 

or i/,_^^ = «i + X(j)AMj + ar(2)A2u, + ..., 

and these fommlae can be used for the summation of the series 
Mj + //„ + Mg + . . . , (See Chap. VI, paragraph 2.) 

10. Subdivision of intervais. 

A frequent problem in actuarial work is the interpolation for 
values of u^. at intervening points given eveiy fifth or tenth value of 
the function. For example, the problem may be to complete the 
series Uq, u^, Uj, « 3, ... from the known values Ug, «io, W15, ... or 
from Ufl, Mjq, tigo’ • 

A simple method for obtaining the intervening values where 
quinquennial values are known is given below. 

Let 8ttj. denote the difference for unit intervals of x and Au^ 
denote the difference for quinquennial intert'als. 

Then may be expressed as either (1+8)^ u^ or as ( i + A) u^. 

Symbolically (i+5)^:=i+A, 

i.e. (i +8) s(i + A)^, 

or S = (i+A)f-i. 

From this relation w'e can find easily that 

=(-2 A--o 8A*+-048A*- ..,) 

Hence 5*«j. = {-2A- ■o8A*+ oqSA*- ...)* 

= (-o4A2--o 32A3 
Similarly 8“i/j = (-oo8A®— ...) 

The same principle can be adopted if decennial values are 
known. In that event At/^, A”Mj., ... will represent differences for 
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decennial intervals, and the individual differences will be found 
from the identity S = (i + A)* — i. 

An example will show the application of the method. 

Example 4. 

From the following table of yearly premiums for policies maturing at 
quinquennial ages, estimate the premium for policies maturing at all 
ages from 45 to 50 inclusive : 

Age * • 4S 50 55 60 65 

Premium 2-871 2-404 2-083 r-862 1-712 

The leading differences for quinquennial intervals are 

Au, A*u^ A‘u^ A*7 i^ 

— -467 +-146 —-046 +-017 

The formulae required are 

8tt,5 = (-2A--o8A^ + -048A®--o336A*) Ux= --1078592, 

= (-04A* - -032A® + -0256A*) Mj, = -f -0077472, 

= (-008A* - -0096A*) = - -00053 1 2, 

= -ooi6A*«* = + -0000272, 

assuming fourth differences constant. 

We have therefore by completing the table of differences, 


Age 




8*u, 

8‘u, 

45 

2-871 

-10786 




46 

2-763 

- -lOOI I 

+ -007747 

- -0005312 


47 

2-663 

— -09290 

+007216 

— -0005040 

+ -0000272 

48 

2-570 

— 08618 

+ -006712 

- -0004768 

+ -0000272 

49 

2-484 

- -07995 

+ -006235 



SO 

2-404 






Note. Since we require the value of the premium to the nearest 
penny, three decimal places will be required in the u, column. In this 
example, for results correct to three figures, four decimal places will be 
need^ : 8w, must therefore be given to five decimal places, to six 
and 8*u, to seven, since errors in higher differences accumulate rapidly. 
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EXAMPLES 2 

Given the following data (a), find the missing term or terms {b) : 


I. 

(a) Mo = 580, «! = 

= 556. “1=520, 

“4 = 385: W “3- 

2. 

(a) Ml = 386, M3 = 

= 530, “5 = 810; 

(6) “2; “4. 

3 - 

(a) Mo =150, Ui = 

= 192, “2=241, 

“4=374: W “ 3 - 

4 - 

(a) Ml = 94, 1*3 = 

265, “5=415; 

( 6 ) “2; “4. 

5 - 

(a) Mo = 6021, M, 

= 5229, “2 = 4559, “8 = 3979; (b) uj. 

6. 

(a) Mbo = 92345, 

“41 = 9*556. “52 = 90748, “55 = 88204; (b) M53: “54. 

7 - 

(a) M_i = 202, Mo 

= 175, “1 = 82, 

“2=55; (b) u^. 

8. 

(a) Mo = o, M, = 3, 

,“2=10,113 = 34 

, «5 = 209, Mg = 1 002 ; (i) “4 ; Mg ; “7. 

9 - 

( a ) Mo =192-1, 

“1=187-5, “a 

,= 1847, “3=184-6, “4=194-6, 

“ 6 =i 99 ‘ 4 . W 7 = 2 I 2 - 7 . 

“, = 224-3; (6) 

“e; “8- 

10. 

(a) 110 = 98203, 

“1 = 97843. “s = 

= 97459 . “3 = 97034: (6) “i 85 - 

II4 

, The numbers of members of a 

L certain Society are as given in the 

following table : 




Year 

Number 



1910 

845 



1911 

867 



1912 


Make the best estimate 


1913 

846 

you can of the 


1914 

821 

numbers in 1912 and 


*9*5 

772 

1916 


1916 




1917 

757 



1918 

761 



1919 

796 



12. Find p„ if />4 o = -98428, Ps, = -98335. p64= -98oo8, p« = -97877- 

13. If «o, ttj, f<2, ••• tt# be consecutive terms of a series, prove that, 
if fifth differences are constant, 

«a= -oS^o- •3«i + •75«*t+ + 

Supply the missing term : 

*<0 = 72795 “4 = 67919 

“1 = 7*651 «5 = 66566 

“1 = 70458 i «, = 6 sis 2. 
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14. Mi!3s=: 2-37107 "237 = 2-37474 

«236 = 2-37291 “23S = 2-376S8- 

Find 

15. Given Uq= —-5, Mj = —-484, 1/5 = 0, Mg = -256, find the missing 
terms. 


16. Given the following data: Wo = o, Wio= 15, Wjo^So; estimate j/u- 
If you were given in addition 1/5 = 35, would your estimate be 
revised? Illustrate your answer by a diagram. 


17- 

table : 


Find the value of an annuity at 5I per cent, given the following 
Rate per cent. Annuity-value 


4 17-29203 

4^ 16-28889 

5 15-37245 

si 14-53375 

6 13-76483 


18. Obtain approximations to the missing values: 

50 51 52 53 54 55 56 

/(.x) 3-684 . 3-75O 3-7cSo 3-803 3-S26 

19. The area A of a circle diameter// is given for the following values : 

d 80 85 90 95 100 

A 5026 5674 6362 7088 7854 

Find approximate values for the areas of circles of diameters 82 and 
91 respectively. 

20. Calculate the value of sin 33° 13' 30” from the following table of 
sines : 

angle 3:° 30 31 32 33 34 

sin*” -5000 -5150 -5299 -5446 -5592 


21. 1/75 = 2459; H8n = 20i8; 1/55=1180; i/gg = 402. Calculate the 
values of Mgo and 1/79. 

22. From the data in Qu. 21 complete the table for values of Uj. 
corresponding to individual values of x from 75 to 85. 

23. Four values of a function at decennial points are given. Express 
hhi„ S^i/j (the differences for unit intervals) in terms of the differ- 
ences of the function for decennial intervals. 

p-ind the values «i to «, inclusive, given «o = o> “io=-^74> “ 2 o= -347> 
«jo=- 5 i 8. 



EXAMPLES 37 

24. tto = 23-i234; Mb = 23-7234; Mijj = 24-6834; tti8 = 26-i33o. Com- 
plete the series to Mg. 

25. If you were asked at very short notice to obtain approximate 
values for the complete series / (o), /(i), / (2), ... / (20), being given 
that /(o) = -oi3, /(io) = -248, /(is) = -578, and /{2o) = -983, what 
methods would you adopt, and what value would you obtain for/(9).^ 

26. Mg + Mg = I -9243 M2 + Mg =1-9823 

Mi-|-M,= 1-9590 M3 + Mg = 1-9956. 

Find Mg. 

27. Tables are available giving premiums at age 40 at the following 
rates per cent. : 

Rate per cent. 3 3^ 4 4I 5 6 

/go •025891 -024654 -023517 -022470 -021509 -019811 

It is desired to obtain Pin at 5! per cent. Obtain this, using 

(a) two of the above values; (^ 3 ) four of the above values; (y) six of 
the above values. 

28. Given 

10 in in 

" / (^) = 50042 6, / (x) = 329240, / (at) = 1 752 1 2 and / ( I o) = 403 65, 

1 4 T 

find / (i). 

29. Mi=l; M2-fM3 = 5-4i; Mg 18-47; 

Mj 4 - Mg + Mj -)- Mgo Mji -p Mj2 = 90-36. 

Find the value of Mj for all values of x from i to 12 inclusive. 

x-=io 

30. If you were given Ug, Mj, Mj and E how would you complete 

X* i 

the table of up to M,g? 

. 31. Given «q=ii7-7; U2=iio-y, Mg=: 102-7; 4*30 = 75-4, obtain the 

values of m* for all integral values of x from 0 to 10. 

32. Obtain the following relation between nine terms of the senes 
represented by Mj, u^, Ug' 

“6 = + "e) - f ('<3 + «7) + sV ("2 + "s) - fV («i + *6»). 

and find Mg, given 

“1 = 74556; «a = - 55938 ; M3 = -42796; Mg=-32788; Mg = -i 8432 ; 

M7 = -I3i 65; Mg = -o8828; m* = o. 
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33. It is asserted that a quantity, which varies from day to day, is a 
rational and integral function of the day of the month, of less than the 
fifth degree, and that its values on the first seven days of the month are 

30, 30, 28, 25, 22, 20, 20. 

Examine whether these assertions are consistent. If so, assume them to 
be true, and find (i) the degree of the function, (2) its value on the 
sixteenth of the month. 

34. Extrapolation may be defined as the process of obtaining further 
terms of a series as opposed to interpolation, which is the process of 
finding intermediate terms. 

The values of a certain function, corresponding to the values 4, 6, 8, 
10 of the argument are 914, 742, 605, 500 respectively. Extrapolate to 
calculate the value of the function corresponding to the value ii of the 
argument. 

35. Given «o=i876, “1 = 777, “3=19, “,= —218, interpolate the 
values of Uj, u^ and Wj, and find the form of the function, assuming it 
to be a rational integral function. 

36. Show that Newton’s formula 

= “o ■+ *,i, A«o + *(*, AX + *(4) + • • • 

can be put into the form 

“1 = «o + AX + xabA^u^ — xabc^i*u„ + . . ., 

where a=i— J(«+i), A=i— |(af+i), c= i — J (x+ i) etc. 

Hence show that the successive coefficients converge slowly and 
tend eventually to numerical equality. 
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INTERPOLATION WITH UNEQUAL 
INTERVALS 

1 . In the previous chapter formulae have been developed on the 
assumption that the argument proceeded by equal intervals. 
Although in actuarial problems the data are generally given at 
equidistant intervals of the independent ^ariable, it sometimes 
happens that we are required to interpolate when values of the 
function are known for unequal intervals. In other words, instead 
of values of for arguments x + h, x + 2h, JC + 3A, ... being given, 
the known values correspond to the arguments a, b, c, where 
a — b, b — c, ... are not necessarily equal. 

2. Divided differences. 

Since we cannot use the differences as hitherto defined, we 
adopt a special process involving the argument as well as the entry. 
The differences obtained by this process are called “divided” 
differences, and are found in the following manner. 

Let / [x) = u^ be given for the values x = a, x = b, x = c, x = d ..., 
where the intervals need not be equal. 

Then we have 

(i) First divided differences: 

“6-«a . 

b-a ' c-b ’ d-c ’ 
which may be written as 

/(*.<»)■. /(‘^. *); f{d,cy, .... 

(ii) Second divided differences: 

fje, b)-f{b, a) _ /{d, c)-fic, fe) . 
c-a ’ d-b 

f{c, b, a); f(d, c, b); .... 


or 
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(iii) Third divided differences: 


f{d, c, b)-f{c, b, a) 
d-a 


or f{d, c, b, fl); .... 


3. Notation for divided differences. 

There seems to be at present no universally recognized notation 
for divided differences. There are objections to the many forms 
in practice. For example, A'w„, representing (uh — Ug)l(b — a) 
(Freeman’s Actuarial Mathematics, p. 57, and Henry’s Calculus and 
Probability, Chap. VIII), has the disadvantage that the dash is apt to 
be confused with the index (cf. and A’^*u„). A preferable form 
A {a, b), A^ [a, b, c) ... is clearer, but departs from the recognized 
symbolical notation in that the function on which the divided 
difference symbol operates is not present ; this renders the notation 
unsuitable for elementary work. The method adopted in paragraph 2 
above is simple, and where there is no ambiguity with f{x,y), 
representing a function of two variables, there is no objection to its 
use. This notation does not, however, conform with the A notation 
for differencing when the intervals are equal and to that extent it 
is hardly satisfactory'. 

The following are some alternative notations that have been used : 
Arguments Function Divided differences 

(1) a,b,c... u„.u^,u,... A{a,b), A{b,c), \-{a,h,r) ... 

( 2 ) a,b,c... Ua,u^,u,... {a,b),{b,c),{a,b,c)... 

( 3 ) Xo,Xi,x^... f(xu),f{x{),f{x^)... f(Xo,X,),f{Xi,X,),f{x^„Xi,X.^)... 

(4) ^0. ^1. •’^2 • • ■ /(*o). /(*i)./(-*'2) • • • ^1]. [^1. K. *1. '■^’2] • • ■ 

(5) ^o> ‘^11 a^... A(i, Aj, ... ... 

(r) D. C. Fraser, (2) W. F. Sheppard, (3) J. F. Steffensen, 

(4) Milne-Thomson, (5) De Morgan. 

The notation that we shall adopt is due to Dr A. C. Aitken 
(Proc. Roy. Soc. Edin. vol. lviii, pp. 169 and 175) and has all the 
advantages possessed by the ordinary difference symbol. 
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The convention and definition are 

Aua = {u^-ui,)l(a-b) 
ft 

_ Ug 

a — b b — a' 

A*Ma= A (Am„) 

be c b 

= I_ f >‘a Vl + 1 + “ft 1 

a — c\a — b h —a\ c — a \c — b h —ci 

^ W,x , __ "h , 

{a-b)(a-c) (h-c){b — a) {c — a}(c — b)' 

Similarly it may be shown that 

*3 _ “a I “ft 

- la-b) {a-c){^) '(b^c) (■* - d) (^-T) 

^ {c — ~d) {c-a) {c — b)^{d—a) (d — b) (d-c) ' 

The symmetry of these differences is apparent and suggests 
that the property holds for a divided difference of any order. 
This is in fact true and the general proposition may be proved in 
many different ways. 

It may be proved by induction, by actually arranging the co- 
efficients or, incidentally, in establishing another important pro- 
position, namely that if it,, is of the form (x) then ^Uj. is 
constant (see paragraph 6). 

An elegant and succinct proof, based on the permutability of 
the arguments, is due to Dr Aitken. and is given below. 

*4. By definition 

Ah^=(Mj.-«„)/(-^-«) 

a 

= Au„. 

X 

The suffix of the operator and the argument of the operand are 
therefore interchangeable. Also, if Uj. is of the form P„ (.v), by the 
remainder theorem x — a is a factor of u^ — u^ and it follows that 
A«a is of the form P„_j (x). 
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For divided differences of higher orders the definitions are 
A (AMa:)= 

b a ab 

A (A*«J = AX. 

c ba abc 

and so on. 

A divided difference of any order, e.g. /^u^, is unaltered by any 

abc 

permutation of the arguments. For, since the suffix of any A 
operator and the argument of the operand are interchangeable, we 
have such interchanges as 

A (A«x)=A (AMa)=A (A*/;,) 

b a b X ax 

= A (AMj.)= A (AwJ 

a b X b 

= A (AKd)- 

X a 

All permutations can be obtained by successive single inter- 
changes. Thus, in the illustration above, 

box -> bxa -*■ axb -*■ abx -*■ xba xab. 

These interchanges are obviously possible for divided differences 
of any order and it follows therefore that a divided difference is a 
synunetrical function of all the arguments involved. 


Note. The proof given above depends entirely on the symmetrical 
property of the differences. It does not follow that if = then 
A«*= Ae'v In passing from, say, A (AkJ to A ( Aki>) the student 

a a a b a X 

should satisfy himself that the equivalence holds by writing down 


A (Amx)=— ' 

a b X Q 


X — A 


a — b I 




{x — a)(x~b) (b—x){b — a) {a — b){a-x)' 


A (Aub)= 

a X 


I 

h — a 



Ua — «x 

1 b — x 

a — x 


+ 


(6 — x) (6-a) {x — a){x—h) {a — b)(a — xy 
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6. The method of forming a divided difference table is best 
illustrated by an actual example. 


X 

1 


2 


4 


7 


12 


Example 1. 

Take out the divided differences of Wj. given the following table: 


X 124 7 12 

22 30 82 106 206 


The table is 



Au* 


22 

30-22^8 

2 — 1 


30 


-6-8.6 

4-1 


82-30_,6 

-3-6-6_ 


4-2 

7 -J 

82 


8 — 26 , -51— ( — 1-6) 

= - 36 ^ ^=192 

7-2 12-1 


106 — 82_ 0 
7-4 

i-5-(-3-6)_ 

12 — 2 ^ 

io6 


12-4 


206 — 106 


2o6 


Note. When the arguments and their order are fixed, we may use 
the shortened notation ... for the leading divided 

differences of u^, as in the example above. 


It is essential to arrange the work systematically if error is to be 
avoided. The numerators and denominators must be set out, either 
in parallel columns or in the form of fractions. Where there is 
ample space the columnar arrangement is better, especially where 
the divisors are cumbrous. It should be noted that while the 
numerators are the first ordinary differences of the preceding 
divided differences, the denominators are all formed directly from 
the arguments, differencing first in the ordinary way, then in pairs, 
then in triplets, and so on. It will also be seen that the divisor 
is always the difference between the values of x for the last and 
first involved in the difference. 
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6. Newton’s divided difference formula. 

Let the given values of Uj. be 

Hfc, ... Ufc, u,. 

Then, by definition, u^ = Ua + {x-a) 

X 

A «o = “a + (^ - *) A®Ma. 

X b bx 

A®Ma. 

bx be bex 


A\' Ma = A" «a + (^ - 0 A’' + “ Ma . 

bc...kx bc...kl be. ..lx 

Hence, by successive substitution of each identity in the one 
preceding it, we have 

“x = “a + (*-«) + (^-*) AX+--- 

b be 

+ {x — a){x-b)...{x~k) + ■■■ A"''‘^Wa- 

bc...t be. ..lx 

Thus u^=fJx + K+i(x), 

where represents the sum of all the terms except the last and 
i?„+i (x) the last, 

(x) may be written as 

(x - a) (x “ f>) ... (x — 1) t 

abc . . I 

by permuting the arguments of the divided difference. 

It should be noted that the term preceding R is 
(x - a) (x -/»)... (x - k) L’‘«a. 

be. ..I 

involving n factors in the coefficient. In R there is an additional 
factor x — l. 

If we put X equal to a, 6, c ... in succession, the R term vanishes and 
we have the following results which are identities, analogous to those 
obtained for equal intervals in Chapter I : 

[ = "6 = «<. + (/^ - a) A "a . 

b 

[LJx_c = u, = «„ + (£ -a; A«« + (c-a) {c-b) A“Mo, 

b be 

[t^*]x-d = «d = «a + (^-«) AMa + (<^-«) {<i-b) AX 

+ {d—a) (d—b) {d—c) AX- 

bed 

It is seen that R is the only term involving ; it vanishes if x takes 
any of the values a, b, c ... I, or for any value of x if A"+X = 0' 
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It follows therefore that is that polynomial of the wth degree 
in X which takes the (n + i) given values u„, «{,, ... Uj, when 

x = a, b, c ... I respectively. If therefore is itself a polynomial 
of the nth degree in x it must be the polynomial i.e. R vanishes 
for all values of .v. We have therefore th.at 


V' ' = o and 


(ibr ...I 


AJ'Uj is constant. 

be. ..I 


Uj. is thus the unique polynomial of degree n which represents 
«(,, .... Therefore the coefficient of x" does not depend on the 
order in which a, b,c, ... are taken. Since this coefficient is 
it follows that the divided difference is a symmetrical function of 
all the arguments involved. (See paragraph 3 above.) 

If Hj. is not a polynomial of the wth degree in x, we have only the 
relation /r , „ / 


where the R term itself involves u^, and if the expression 
+ (V) be worked out by expanding the coefficients all that 

we obtain is the identity u^ = Uj.. Thus our results do not help us 
to find Wy unless we have some knowledge of or may make some 
assumption with regard to the value of (.v). It may be said 
in fact that (.v) plays the same part as does Fj. in Chap. I, 
paragraph 17. As in the case of ordinary’ differences it is assumed 
that Rn^i (^) is negligible and may be put equal to zero, giving 
finally 

.r 

= i/„ + (A -«) ^.M„ + (x-a) (x-b) 

b be 

+ (.v-a) (x-h) ... {x~k) A"Wa, 

be ...I 

which is New’ton’s formula for interpolation with divided dif- 
ferences. 

For the method of obtaining the limits between which R„^.i (x) 
lies, see paragraphs 16 and 17 below. 


7. Sheppard’s rules. 

If, as in the paragraph above, the arguments are x, a, b, c ... k, I, 
where is to be found and the w + i values r/,,, wj,, u^... u^, Ui are 
given, we may put Newton’s formula into a more compact form 
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by the use of a notation due to Mr D. C. Fraser. This notation is as 
follows: 

x—a=A, 
x — bsB, 
x—c = C, 


The arguments a, b, c ... involved in the divided differences and 
the factors A, B, C ... are in the same order, with the exception 
that the small letters are always one in advance of the capitals, 
thus: 

Arguments, a ab abc abed abede .... 

Capitals, 1 A AB ABC ABCD .... 

Newton’s formula thus reads 

Ux = Ua+A^Ua + ABA^„ + ABCdiu^ + ...-\-ABCD ... K 

b be bed be ...I 

Dr W. F. Sheppard has given very convenient rules, embodying 
the same principle, and by means of these rules formulae may be 
written down at sight for any intervals. As expanded by Mr G. J. 
Lidstone [y.I.A. vol. Lviii, p. 65 and references there given] these 
rules are as follows : 

“(i) We start with any tabulated value of u. 

(ii) We pass to the successive differences by steps, each of 
which may be either upward or downward”; 

[each step involving a new « whose subscript will be numerically 
the next lower/higher if the step is up/down, and the w’s are 
arranged in the numerical sequence of the variables.] 

“(iii) The new suffix [of «] which is introduced at each step 
determines the new factor (involving x) for use in the next term.” 

[That is, each divided difference of the nth order has for its 
coefficient the product of n factors of the form (x—a^.) where 
represents a value of the variable, and has to be given all the n 
values that were involved in the last preceding difference. 

These rules apply whether the intervals are equal or unequal ; if 
they are equal the divided differences are of the form A^u/n!.] 
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These rules are familiarly known as the “zig-zag rules’’ for 
reasons which will now be explained. If we are given a set of u’s 
and their differences as in the following scheme 

«o 


«6 

be 



d cd6 


Ud 

de 

e 

and use them to interpolate for u^, it is not necessary to use the 
formula involving and its leading differences. We may begin 
with any of the u’s and at each step move either upwards or down- 
wards in proceeding to the next column in the difference table, so 
that we may if we like follow a “zig-zag” route; in fact with n 
values we have a choice of 2”“^ different routes. [Cf. Sheppard, 
y.I.A. vol. L, p. 89.] 

Suppose we wish to begin with u^. The next term may be either 
or AMci since both of these involve u^: let us take in- 

C d c 

volving and u^. We may next take either or since 

be cd 

both of these also involve and Wj, : let us take We may then 

cd 

move to or and we select the latter. We then have no 

bed ede 

further choice and we move to Our formula thus brings in 

bede 

the small letters in the order 

c, b, d, e, a 

and therefore the factors of the coefficients will enter in the order 

1, C, B, D, B. 
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Thus we may write down at once the required formula 
Uj. = u^ + C£\Ui) + BC + fiC/) + BCDE A* 

c cd cde bcd6 

or 

= f/j. + (,v - r) A «(, + (^ - i) {x - c) A^M(, + (JC - {x -c){x-d) A®«b 

c cd cde 

+ (a: - b) (.V - f) (a; - d) (x - e) /S^u^. 

bcde 

Sheppard’s rules are of the utmost importance, and a clear 
understanding of them will save the student much trouble. 

8. The following is an alternative proof of the property of divided 
differences which was established in paragraph 6. 

If j/j. be a polynomial of the «th degree in x, then the nth divided 
difference of Uj. is constant. 

It will be sufficient to consider the function y = x”. Then, if the 
values of the argument x be a, b, c the first divided difference 
A< 2 " is 

b 

(6" — aP)!{b — a) = 6"“* + + b”~^a^ -f . . . + 

This is a symmetrical function of the (n — i )th degree in a and h, 
and is the coefficient of x”-i in the expansion of 

(i +bx + b'^x^-\- ...) (1 +cx + a^x 2 + ...), 

i.e. in the expansion of . , 

^ I — ox I — ax 

Thus A A" is the coefficient of x"~^ in — ^ and therefore 
c \—cx I— bx 

A*<i" is the coefficient of in 

^ 

c — a\i—cx i—bx i—bx 1— ax/’ 

i.e. in x 1^ — ^ . — - — I, or the coefficient of x"“® in 
' i - ex I - ox I - ox/ 

III 
I — ax’ I—bx 1 — ex' 

Proceeding in this way, we find that Ala” is the coefficient of 

be... 
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III 

xn-r ijj _ where there are tc factors in the 

i—ax i—bx i—cx 

product. This coefficient is evidently symmetrical in a, b, c .... 
Finally, is the coefficient of ic® in a similar expression, and 

be... 

this coefficient is unity. 

Thus, the «th divided difference of a polynomial of the «th degree 
in X is constant and equal to the coefficient of the highest power 
of and higher divided differences vanish. Also, it has been 
proved incidentally that a divided difference is a symmetrical 
function of the variables involved; for the expression obtained for 
is symmetrical in a, b, c — 

bed... 

9. Newton’s divided difference formula is quite easy to apply in 
practice, as the following example will show. 

Example 2 . 

From the data in Ex. i, find Ug. 

Assuming fourth divided differences constant, the formula gives 

«8 = Mi + (8- i) .l«i + (8- i) (8-2) 

+ ( 8 -i)( 8 - 2 ) (S-4) A’«i + («-i)(8-2) ( 8 - 4 ) (8-7) AS 

= «i + 7 A«i + 42 AS + 168AS+ 168AS 

= 22 + 56 + 252 - 2688 + 32-3 
= 93 to the nearest integer. 


10. Relation between divided difSTerences and ordinary differences. 

If the arguments a, b, c ... k, I are spaced at equal intervals A, 

we shall have , , , , , , 

n = o~a = c — b = a—c=...=l~k. 


Hence in forming Aw the divisors will all be equal to h. 

Thus AWa = Am<,/A etc. 

6 

Similarly in forming A®« the divisors c — a, d-b ... will all be 
equal to zh. 


AS = AS/2!A*- 


Oe 

In forming A®m the divisors d — a, e~b ... will all be equal to 3A. 


F H A S 


4 
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A®Ma = ^®«a/3 1 

bed 

In general A"«a = ^”ujn ! A". 

bed... 

If therefore the common interval h 5 unity, the divided diflFerencea 
will diminish much more rapidly than the ordinary differences. 
In interpolation formulae this is counterbalanced by the denomi- 
nators of the form n! which appear in the coefficients of the 
ordinary differences but not in the coefficients of the divided 
differences. 

The formula for Mj. in terms of and its leading divided 
differences is 


= + A«a + (A;-a) {x-b) AX+--- 

b be 

+ {x-a) (x~b) ... (x-k) A"Wn- 

be. ..I 

This becomes, on putting x — a = nh, 

“a+nA = + "A [nA - (i - O.)] -f- . . .. 

b be 

If now b — a = ht c — a = 2h ... 


nhAu„ nh (nh — h) A^u,, 

«a+n)k - «a + “ ^ 

nh (nh - h) (nh - zh) 

3!A» 

from the relations proved above; 

ie. = «„ + n(i, Aw„ + n® + n,,, + . . . , 

which is Newton’s formula for advancing differences. 

It is easily seen therefore that in order to pass from the divided 
difference formula 


«x = «a + (*-«) AWa + (*-a) (x-b) A®«a+"- 

b be 


to the advancing difference formula (when the intervals are the 
same) we may replace A by A, drop the subscripts and insert 
factorial denominators thus: 


. / _\ A . . 


I 0 AS.. 
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11. Lagrange’s interpolation formula. 

On the same assumption as has been made hitherto, namely that 
the function concerned is a polynomial in x, an interpolation 
formula can be evolved which is equivalent to the process of 
splitting up an algebraic fraction into its partial fractions. 

Let n values of the function y = u^ be given, so that is sup- 
posed to be a polynomial of the (w — i)th degree in x, and let the 
given values of x be a, b, c, k. 

Then we may write 

u^ = A {x — b) (x — c) ... (jc — ^) -t- B (jc — a) (.r - c) ... (x — A) - 1 - . . . 

+ K{x-a) (x-b) ... (x-j), 

where there are n terms each of degree n—i in x. 

This is true for all values of x involved. Put therefore x = a. 

Then u^ = A (a-b) (a-c) ... (a-k), 

• J Ifa 

(a — b){a-c)...{a — k)’ 

Similarly, by putting x==b, 

% 

(b-a) (b~c) ... {b~ky 

In like manner all the coefficients can be found. 

• u =u ( ^-^) (^- ^) (^-^) I ■■ (^- «) (-^-0 jx -k) 

■■ * '^la-b)[a-c) ...{a-ky '>{b~a){b-c) ...(b-k)^- 

(.v-g) jx-b) (x-c) ... 
‘‘{k-a)(k-b)ir-c)...- 

or otherwise 

Wa 

(x — a) (x — b) ... (x — k) (a — b)(a — c) ... {a — k)'x-a 

^ (b — a) (b — c) ... (b - k)' x — b^ 

It is evident that this is exactly the same as splitting the fraction 

*ix 

(x-a)(x-fc) ... (x-A) 

into partial fractions. 

4-2 
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This alternative expression is due to Euler and was given earlier 
than Lagrange’s formula. 

It is interesting to note that Euler’s form, when written as 

, «a _ . J'b , 

{x — a){x — b)... {a—x){a~b)... {b — x){b — a)... 

is an expression for the divided difference A” It follows therefore 

cibi‘ 

that Euler’s formula (and consequently Lagrange’s) can be evolved 
from the divided difference formula by equating the nth divided 
difference to zero. Also, since the expansion is symmetrical in x, a, b, c 
... k, the divided difference is independent of the order in which the 
arguments are taken, as stated in paragraph 3 above. 

12. Lagrange’s formula is usually laborious to apply in practice ' 
and requires close attention to sign; it is generally simpler to 
employ other finite difference methods. Where the intervals are 
equal an advancing difference formula may be used, and for 
unequal intervals it is preferable to use divided differences. 

The principles on which this formula has been developed are 
the same as those assumed for the difference formulae, namely that 
n values of the function being given, nth differences are assumed 
zero. The following examples show the application of the formula: 
Example 3. 

Given the data in Ex. i, obtain Ug by tlje use of Lagrange’s formula. 

^ “1 I__ 

(8- i )(8-2)(8-4)(8-7)(8- 12) (i -2)(i -4)(i -7)(i - i2)’(8- i) 

, I 

(2 - 1) (2 - 4) (2 - 7) (2 - 1 2) ■ (8 - 2) 

W4 I 

■*■(4-1) (4- 2) (4-7) (4-i2)'(8-4)'^‘‘’’ 
ttg _ 22 30 

7.6.4. i .(-4)"(- i )(-3)(-6)(- ii ).7''' i .(-2)(- s )(- io).6 

82 106 

^3-2.(-3) (-8).4^6 .s. 3.(-5 ).i 
206 

■'■11.10.8.5. (-4)' 

iig= - 10-666 ... + 33-6-95-666 ... + 158-293 ... + 7-865 
= 93 (to the nearest integer) as in Ex. 2. 
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Example 4. 

Find the form of the function y = Uj. given that 

«„ = 8, 1/4 = 68, «6=I23. 

By Lagrange’s formula; 

_ 8 I 1 1 I 

*(*-!) (*-4) (*- 5 )‘'(-i)(- 4 )T-^'*^i (- 3 )(- 4 )’*-i 

68 I 123 1 

4 - 3 (-i )'*-4 S- 4 -i'*-S 

2 r II I 6S I 123 I 

= . . - ^ 

5 jc 12 x—i 12 * — 4 20 * — 5 

_ I ii 5 ac + 40 I 57*-24 
~2oa;(jc-s) i2(x-i)(*-4) 

23^;+ 8 19^;- 8 

~:^x{x-s) 4 (*-iHx- 4 )’ 

[(23X + 8) (x= - 5x + 4) - ( 1 9x - 8) (x2 - 5x)] 

= x^ - x2 + 3* + 8. 

It is useful to work out this example by divided differences, adopting 
two different orders for tlie values of x, thus illustrating the principle 
that, if the same u's are involved, the order is indifferent. 


(a) X 




A’u, 

0 


8 





3-^1= 3 



I 


II 

16 = 4 = 4 




57-^3 = 19 


5-5 = 1 

4 


68 

36^4 = 9 




t-n 

II 



5 


123 




= 8 + 3 x + 4x (x- i)+ ix(x — 

i){^- 4 )=*^- 

x® + 3X + 8. 

(6)x 



A‘u. 


5 

123 






-” 5 -^ - 5=23 



0 

8 


-8-r -1=8 




+ 60= +4 = 15 


-4+ -4 

4 

68 


+ 4 = +1 = 4 



I 


II 


- 57-~3 = i9 
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W, = 123 + Z 3 (*- 5 )->- 8 (*-s)*+i (*- 5 )*(*_ 4 ) 

= 123+23*— 115 + 8** — 40* + *®-9*> + 20* 

= **-** + 3 * + 8. 

Here, for example, 

= A “4 = i 9 , 

4 1 

I. 

1,4.5 0.4,1 

1 3 . The following examples are instructive : 

Example 6, 

“0 = - l 8 , Mj = O, (/g = O, = - 248 , ttg = O, W, := 13 104 . 

Find the form of u^., assuming it to be a polynomial in *. 

Now since Uj = o, 113 = 0, iie=0) the function must be of the form 
(* ~ t) (* — 3) {x- 6 )tj) (*), where <f> (x) is a polynomial in * of the second 
degree. 

=9{o) A ^ (o) = i, since Mo = “ i8 
= ^(5) < l >{ 5 ) = 3 ‘> .. «i5=- 248 

= <f>{g) </.(9) = 9i, „ n,= 13104 


Whence, from the divided difference table, 

X 

4 (*) 


0 

I 

6 

s 

31 

I 



15 

9 

9 » 



= <^ (0) + * (*) + *(*- 5) {*) 


= I +6* + * (* — 5) 

= ** + * + 1; 

/. Wi = (:r-i) (*-3) (*-6) (** + *+!) 
= **-9**+ 18** -** + 9*- 18, 


4.2 (-1) 


and 


8.6.3 
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Example e. 

Given % = 23, «,i = 899, M27= 17315, *<34 = 35606, t<42 = 685io, con- 
struct a table of divided differences and extend the table to include 
arguments * = 3 repeated as many times as may be necessary to find 
Uj. in powers of (x — 3). 
b'rom the data we have 


X 




5 

23 




146 



II 

899 

40 



1026 


I 

27 

173 1 5 

69 



2613 


I 

34 

35606 

100 



4U3 


X 

42 

68510 

a 



b 


I 

3 

c 

d 



e 


1 



f 

X 

Now 

a= 100 — 24x1 = 

76 



*= 4113-31 = 

1757 



c =68510 - 39X i = — 

13 



d= a -31x1 = 

45 



€ = b — 39X<f = 

2 



/= d -39^1 = 

6 



t<, = r + (A:-3)eH-(*-3)*/+(A,--3)*x i 

A 


= - 1 3 2 (x - 3) 6 (x - 

3)* + (*-3)’- 


14 . Newton’s 

formula with divided differences may be considered 

as the basic formula in finite differences. 

It has been shown that, 

by making the intervals equal, the ordinary advancing difference 
formula follows, and that Lagrange’s formula can be evolved from 
the divided difference formula by equating the nth divided dif- 
ference to zero. Moreover, by taking the limiting values when the 


intervals tend to zero, Taylor’s theorem can be obtained. 

The formula is of the utmost importance analytically and 
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historically, and the advanced student may be recommended to 
read Mr D. C. Fraser’s “Newton’s Interpolation Formulas” 
{J.I.A. vol. LI, pp. 77-106 and pp. 21 1-32, and vol. Lviii, pp. 53-95) 
and the same author’s “Newton and Interpolation” (an article 
in Newton, 1727-1927, a memorial volume published by the 
Mathematical Association). 

*15. Adjusted differences. 

There is a system of differences which may be considered as 
the connecting link between ordinary differences and divided 
differences. These differences, which are called adjusted differences, 
were used by Newton and re-discovered by Sheppard. 

When the successive arguments are a,b,c ... the relation between 
adjusted differences and ordinary differences is as follows : 


Order of 
differences 

Divisor of Uilference 
for divided 
differences 

Divisor of difference 
for adjusted 
differences 

First 

a — b 

a — b 

Second 

a — c 

i (a-c) 

Third 

a—d 

lia-d) 

Fourth 

a — e 

i(a-e) 


Ordinary differences are adjusted differences and both sets can 
be used in the same scheme. 

An interesting account of the Newton-Sheppard system of adjusted 
differences will be found in J.I.A. vol. Lvm, pp. 60-74 (D- Fraser). 

*16. An expreaaion for the nth divided difference. 

The results of paragraph 5 may be written in the form 

K+i {x) = u^-U^, 

where (*) vanishes for w + i values of x, say a,b,c ... 1 . Hence, 
by repeated application of Rolle’s Theorem (Part I, Chap. IV, 
p. 62), it follows that, in the interval which includes these values, 
the first differential coefficient of R with respect to x vanishes at 
least n times, the second differential coefficient vanishes at least 
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n — I times, and finally, the wth differential coefficient vanishes at 
least once, say where x = ^. 


■■■ 


Now is of the form P„(a:), and since the nth differential 
coefficient of P„ (jc) with respect to a; is n ! times the coefficient of 
a;", it follows that 


be ... I 


or 


I rd'' 1 


for some value of x (^) within the range which includes a,b,c ... I, 
and X. 


*17. Remainder term in the divided difference formula. 

If in the proof above we bring in another term Uj., we have 


^/"+i 

where — , , tic is written for 

d^n+i , 


■‘C* «« = [^1 + 0 !. 

f— 1 


This is the divided difference involved in R„+i (.v). 


W = (* -O^ri “?]/(” + 0!- 

When the intervals are all unity, this becomes 

^n+l 

f 1 (^) ^{n + n ^n+i ’ 

where f is some value in the interval including all the arguments 
involved, i.e. the given arguments a, b ... I and also x. 
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EXAMPLES 3 

1 . Given terms at unequal intervals, explain how to apply the method 
of divided differences to find an interpolated value: illustrate your 
answer by finding given 

«4-6o=i34S. «4-55=i47o, «4-?o = 20Io, tti.9o = 38iS. «6-i6= 10965. 

2- *<40 = 43833. *<42 = 46568, *<44 = 49431, *<46 = 50912. Use divided 
differences to find *<43. 

3. Given the following table, find log 656: 

No. 654 658 659 661 

Log 2-8156 2-8182 2-8189 2-S202 

4 - «5o=*'6990, «62 = I' 7 i 6 o, *<64=1-7324, **66=1-7404. Find *<53 by 
divided differences. 


5. **35-0= 1*75. *<38-6= *280, *<34.6 = 2180, *<46.6 = 2420. Obtain a 
value for u^g (i) by advancing differences, (ii) by divided differences. 

6. *<,.0 = 235, u,. 1=256, *<7.4 = 436, **3.1 = 484. Find Uj.o. 

7. Find the first three divided differences of the function y = for 
the arguments x = /, m, «, p. 

Find by Lagrange’s formula the value of 

8. 1*48 given *<40=15-22, **45 = 13-99, **50=12-62, 1*66=11-13- 

9. *<g given *<o =I 7 ‘ 378 , **6 = 15-894, *<10= 14-270, *<15= 12-412. 

10. *<23 given Ujo = 22-40, *<15 = 21-66, *<20 = 20-82, u25=i9-85. 

11. *<i given *<o = -400, = -128, = -224, *<4 = -376. 


12 . 

given 


Use Lagrange’s formula to find the form of the function y=f{x) 

X 0236 

/ (*) 659 705 729 804 


13. Values of w* are given for all integral values of x from o to « - i. 
Show that Ux is capable of expression in the form 

(*-«)! (n-i)l [*-«+ I ' '*”*-**-4 2 


+ (*• “ *)«) -•■• + («- t)(n-l) 



Find «4 given *<o= 4 . *<1 = 7. *<2 = 12, *<3 = 20, by using the above 
formula. 
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14- ®y *neans of Lagrange’s formula, prove that, approximately, 

(t) «i = «3--3 (“5-«_3) + -2 («_3-M_5), 

(2) «0 = i [“l ,+ M-l] - 4 [H“a - J («-l - 


1 5- Four equidistant values m_j, u-^, and being given, a value 
is interpolated by Lagrange’s formula. Show that it may be written 
in the form 


+xu■^ A2«_ + AX. 

3 1 3 ! 0 

where x + j = i. 


i6. If etc. be 

^i-«0 ‘h-Oi 

divided differences of the first order;/ X, «., °o) 

^2 - ’ 

etc. divided differences of the second order and so on, find /■ (2, 4 q tol 
where / (*) = (i) ar' - 2.r, (ii) + JcX I . 


17. Prove that if be a polynomial of the nth degree in x, and if 

values Ma, Mj, u of be given, then the expressions for Uj in terms 

of its divided differences are identically equal whatever the order of 
arrangement of the u’s. 


18. Apply Lagrange’s formula to find / (5) and /(6), given that 

/(i) = 2, /(2) = 4, /(3) = 8, /(4)=i6 and /(7) = i28; 

and explain why the results differ from those obtained by completing 
the series of powers of 2. 


*9- “-30-30; “-13-34; “3 = 38; “13 = 42. Find 

^9; “72 = 7'07 ; “73 = 6-78 ; i<76 = 6-i 8. Interpolate to find 
by divided differences, using the following orders of the argument : 

(') 70, 73. 75. 72: (ii) 72, 75, 70, 73. 

21. By means of divided differences, find the value of from the 
following table : 

* 17 21 23 31 

“* 14.646 83,526 194,486 279,846 923,526 

22. Prove that di*A:® = A;+jy+aand that 

»» 
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23. If the data are Ug, u,, u^, w,, u^, and the interpolation formula is 

Mj. = «* + Cl + Cj + Cg A®«0 + Ci, A*«o. 

S 4,7 8,1.7 3,4,7,11 

find the values of Cj, Cg, Cj and C4. 

24. If the data are as in Qu. 23 above, and the formula is 
“x=«3 + (*-3) A« + (*-3)(*-4) A®« + (*-3)(*-4)^A®« 

+ (*-3) (*-4) *(*-7) A*«, 

find the missing suffixes of the operators and the subscripts of the u’s. 
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CENTRAL DIFFERENCES 

1. If a series of values of be given, we can interpolate to find 

any intermediate value by one of the methods in the preceding 
chapters. Where the values of the argument x proceed by unit 
intervals it has been shown that, on certain assumptions, Newton’s 
advancing difference formula can be applied to give satisfactory 
results. If the value of were required for some value of x be- 
tween j!: = o and = i, it might be considered that we should obtain 
a better result if our knowledge of the shape of the curve extended 
on both sides of the values of x between which we wish to inter- 
polate. That is to say, where we may choose any values of Uj. at 
unit intervals for our data, it might be of advantage if we could use 
a formula involving values such as u_^, 7/_j, «o, Mj, Wg, u^, ... 

rather than u^, u^, u^, u^, u^, .... By the advancing difference 
formula we expand in terms of a given value of Wj. and its leading 
differences and, by giving x a suitable value, such a formula can 
be made to embrace any values of u that we wish. It is however 
convenient to use special formulae called central difference formulae, 
based on differences obtained from the values of on either side 
of the origin; this is found to result in smaller coefiicients and a 
more rapidly converging series of terms. 

2. There are various central difference formulae that are of use 
in actual practice, and the development of the better-known 
formulae is an exercise in the application of the fundamental 
principles of finite differences which will be advantageous to the 
student. These formulae apply whether the values of the function 
correspond to equidistant values of the argument or to values with 
unequal intervals. We shall consider first the general case and pass 
from it to the simpler and more usual case of equal intervals. 
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3. Gauss’s formulae. 

Suppose that we are given the values . . . u^, «6. “e • • ■ 

that we wish to deduce a formula for in terms of the even 

differences which fall on the line of and the odd differences 

which fall on the line between ii^ and u^, as shown by the lower of 
the two series of dotted lines in the following difference table: 

«/ 

to 

.Attl 

lab 

ab labo 

6 abc 

«s A*«« 

6d 

A«6 

e 

The formula will thus involve in succession 
"o. A«a. A^u,, A^u,, A*U„ .... 

6 ab abc lobe 

As we pass from each of these to the next, the new arguments 

brought in, one at a time, are A, /, c, ot Hence, applying 

rule (iii) of Chap. Ill, paragraph 7, namely, that each new argu- 
ment gives the new factor for use in the next term, and adopting 
the abbreviated notation x~a = A, x — b = B etc., we can at once 
write down the required formula. 

The formula will be 

= “a + -^ A«a + ABA^Ui +ABLA^Ui -f- ABCLAfu^ + ..., 

b ab abc lobe 

where the law of formation is evident. 

Now let the intervals between the arguments be all equal to 
unity and let a = o, so that 6 = 1, c~2, ..., /=— i, m=— 2, .... 
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Then, as in Chap. Ill, paragraph 10, we have 

AWfl =Ako; 

6 

=A*u_i/2!; 

ab 

=A»«_i/3!: 

abc 

A*Mm = A«M_2/4!. 

lobe 

Further, A=x-a = x, 

B = x—b = x~ I, 

L=^x-l =.v+ I, 

The formula for equal intervals is thus 

. i)., (a‘+ i) -v (jf— r) ,, 

“* = “ 0 + •*Atto + - — ' A*«_i + A*w_i 

+ ^ --i-i / A*u_r, + ... 

4- 

or 

“o + '»^(1)AMo + *(2)A*W .,! + (*+ l)(3)A®M_i-r(x+ i),4)A1m_2+.... 

This is known as Gauss’s “forward” formula for equal intervals. 

4. If we take the even differences as above but the odd differences 
which fall on the line between u, and (the upper of the two series 
of dotted lines in the table in the preceding paragraph) we obtain a 
formula which involves in succession 

“a. A*U„ .... 

a ab lobe 

The new arguments brought in are /, b, m, c, ... and, using the 
rule as before, we evolve the form 

“* = “a + ih n j + ALA?ui-t ABLA?u„ + ABLM&Stm + • • •« 

a ab lab lobe 
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From this we may pass at once to the formula for equal intervals of 
unity : 

«* = Wo + ^(1) ^“-1 + (* + 1 )(2) ^*“-1 + (* + I ) (3) A®M_2 

+ (v + 2)(4)A^M_2+ .... 

This is Gauss’s “backward” formula. 

Note. It is useful to remember that, after Kq, as we advance from 
each term to the next, in the forward form we alternately deduct unity 
from the subscript of u and add unity to the number whose factorial 
is the coefficient ; while in the backward form we alternately add unity 
to the number whose factorial is the coefficient and deduct unity from 
the subscript of u. 


5. Stirling’s formula. 

Taking the mean of the two Gauss formulae we arrive at the 
following expansion : 




X — I ‘^) 


I (A®«_i + A®«_. 2) + 




Ah/..2 + ..., 


in which we have alternately mean coefficients and mean differences, 
falling on the line through in the difference table. 

This is known as Stirling’s formula. In using it x should lie in 
the range — ^ to + ^, i.e. ^ on each side of the line of 


6. Bessel’s formula. 

Transferring the origin in the Gauss backward formula from 
o to I, we have 

^ 3:(a:-i)., x(x-i){x~2) ... 

“x = «i + (* - I ) ^“0 + - Vj— ^ «o + IT ^ 

I ( Ar+i)j:(j;-i)(x- 2 ) ^ 

4! ^ 

The mean of this and the forward formula is 

«x = i(“o + Wi) + (*-!) A«o 

a:(x:-i),..„ . (jc-i) a: (jc— i) .. 

+ — i + A^Mo) + - ^ A®m_j 

(ac + I ) a; (ac — 1 ) (jc — 2) , , . . . . , 

+ -^i(A‘«_i + A*u_2) + ..., 
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which is Bessel’s formula. In using the formula x should lie in 
the range o to i, i.e. J on each side of the central line. 

In this formula we have alternately mean «’s or differences, fall- 
ing on the central line between u„ and u^, and mean coefficients. 

If we write y + i for x in Bessel’s formula, we obtain the simpler 
and more symmetric form ; 

“x = i («o + «i) +y^^o + “j ^ J + ^^0) 


3! ' 4! 


i (A^«_2 + A*«_i) + . . . , 


the most convenient shape of the formula for practical purposes. 


7. An alternative method of obtaining Gauss’s forward formula 
for equal inteivals is as follows; 

The ordinary advancing difference expansion is 

Uj, = Mo + Xd) AMo •+• X(2) AX + *(3) AX + *(4) + • • • • 

Now AX=^*«-i + ^*W-i'. 

A®Uo = A®m_i + A*m_i ; A*m_j = A*m_3 + A®m_3 ; 


Therefore we may write 

= «0 + *(1) ( A%_i + A»m_i) 

+ x^3, (A®u_j + A«h_,) -I- ( A*u_j + A^<_i) + . . . 

= «o + X(j) Am© + X(a) A*m_i + (x,*, + X(,)) A®m_i 

+ (*(s) *(«)) AXi + . . . 

= “0 + *(i) + ^(*) ^*“-1 + (* + 1 )(s) A*m_i 

+ (*+ 0(4) (AXj + AXi)-!-..., 

since *(r) + *<r+l) = (*+ l)(r+l). 

and so on. 

A proof similar to the above, in which the general term is evolved, 
and which depends upon the method of separation of symbols, will be 
found in y.l. A . vol. L, pp. 31, 3a. 


F M A 8 
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The backward form may be obtained similarly by grouping the 
terms in a slightly different way and using the relations 

Auq = A«_i + , 

A'^ = A%_j + A®«_i , 

A“tt,j = A*tt_g + A*M_g , 

etc. . 

8. Everett’s formula. 

The Gauss forward formula with interval x and initial term Wj 
may be written 

^X+I = ©1 + *(1) + JCte, A% + (a; + I )(3 ) \^Vo + {x+i)u) .i 

+ (^+2)(5)A®t)_, + .... 

The backward formula with interval (x — i ) and initial term 
gives 

+ (^ - I )( 1 ) At'o + A^^o + + (^ + I )(4) A^T'_i 

+ (x+l)(B)AV,ij + .... 

Subtract the second series from the first : then, since 


we have 

At)^, = X(1 , At^i + (x 4- 1 ) (3 ) + (x + 2) ,5 ) + . . . 

-{x~ i)(i)At>o-X(3)A»t;_i-(x+ i),5)A®t)_5j-.... 
Put u,, Au^, A^m^, ... for iiv^, A*t;^, ^^v^, .... 

Then 


“* = X(1, Ml + (x + iXs) A^Mq + (x + 2)(5) A*«_I + . . . 

-(x- i)(i)Mo-X( 3 ,A='«_i-(x+ i)( 5,A«M_3- (i) 


When X is less than unity a convenient form of this formula for 
interpolation between and Mj is obtained by putting f=i— x; 
thus 


x(x®-i) X (x*- i) (x* — 4) 

Mjf = x«i +— — - A*m„+ — — A*u_i4- ... 


3! 


5I 




the most common form of Everett’s formula. 
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The above elegant proof is due to G. J. Lidstone {J.I.A. vol. lx, 
pp. 349-52). In his note on this formula Mr Lidstone shows how to 
obtain another formula similar to the above for interpolation between 
u-x and uj. 

If we difference formula (i) bearing in mind that 
A('* + r),„=(jc + r)„_i„ 

we have + (* + 1 )(j) AX + (*^ + 2)14) A‘«_i + . . . 

- Uo - JCfj, A^M_j - (* + I )(4, A*«_j - . . . , 
or, writing for Aa^., Aa* for AX and so on, 

Mj- = “o + (*+i)( 2) Aao+(* + 2),4, A 5 a_i + ... 

- *(2) Aa_i - (* + 1)<4, A^a.j - . . . . 

The formula then become.s 


^2 1 


(/>'-!) 


l! 


A*a_i+ ... 




2! 


4! 


where p = |+ A;and q=\—x. 

This form is generally known as Everett’s “second ’’ formula. Both 
formulae, perhaps more particularly the first, are specially adapted for 
use in statistical work. They can be applied in the case of unequal 
intervals: see Todhunter, vol. 1,, p. 137, and Lidstone, Proc. iEdin. 

Math. Soc. Series i, vol. XL, pp. 25-6. 


9 . Everett’s formulae can also be obtained from Gauss’s formulae 
by a simple transformation depending upon the properties of the 
binomial coefficients. 

Any pair of terms of the form 

3 '(r)AX + (j+0(rM)A'’+’«< 

= (V+ l)(r+l) (A'-W, + A’-+X)- {{y+ l)(r+l)->'(r>} A’’«, 

= ( J + J)(r+1) A’'«,+l - J(r-1 1) A'-«<, 

since (J+ I)(r+l)-J(r)=3'(r+l)• 

Taking the pairs 

1-2; 3-4; 5-b ... in Gauss’s forward formula, 
and 2-3 ; 4-5 ; 6-7 ... in Gauss’s backward formula, 
the two forms of Everett’s formula are at once obtained. 

\ I' 


\ 
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* 10 . It is of interest to show in a simple manner the relations be- 
tween Everett’s and Bessel’s formulae. The formulae are closely 
related, and the following demonstration indicates clearly the con- 
nection between corresponding coefficients in the two expansions. 
In Bessel’s formula let denote the coefficient of the “mean” 

difference , /»«, .o., ^ 

i (A2»u_„ + A2»u_„^i) 

and the coefficient of the difference A^"+^m_„. 

Then, since , , 

= A2"m_„ + A2”+i«_„, 

the sum of the terms 

K, ■ \ (A*™m_„ + A*"m_„+i) + A*"+iu_„ 

may be written as 

62„A2"u_„ + (|62„ + 62„^i) A2"+1m_„ (a) 

Consider the expression 

f2nA®"M_„ + C2„A2”«_„+j.. 

This is the same as 

(Cjt. + e2n) A*"l/_„ + €2„ A2»+lu_„. (6) 

Again, by reference to Bessel’s formula, it will be seen that 



Equating (a) and (i), and introducing the relation (c), we have 

^n + *2n = ^2n» 

and f2„ = \b^ ■+■ bg^^x 

= *2n{i + (^-i)/(2« + l)} 
x+n ^ 

whence C2„ = — i2n+i 

= *2»{i-(*-M2« + t)} 

x — n—i 
2n+i 

from which the coefficients already given in Everett’s formula can 
at once be deduced. 



ILLUSTRATIVE EXAMPLE 


69 


The same analysis applies to each pair of terms. 

It should be noted that 

(i) and c, are the same functions of x and i respectively; 

[In fact, e 2 ji = (■* + w)( 2 n+l)» 

*2n~ ^){2n+l) 

= (l-JC + «)(2„+j)] 

(ii) the pair of terms in Everett’s formula is exactly equivalent 
to the corresponding pair in Bessel’s formula. In Everett’s formula, 
therefore, precise allowance is made for the next odd difference, 
although this difference does not appear explicitly. 

Similarly, by taking the pair of terms 

^2„+l • i 

and 

in Stirling’s formula, Everett’s second formula may be obtained. 

11. It will be instructive to compare the results brought out by 
applying first, Gauss’s forward formula, and, secondly, the ordinary 
advancing difference formula, to the same set of data. 

Example 1. 

Interpolate by means of Gauss’s forward formula to find the present 
value of an annuity of i p.a. for 27 years at 5 per cent, compound 
interest, given the following table ; 


No. of 

years 

15 20 25 

30 

35 40 

Annuity-value 10-3797 12-4622 14-0939 15-3725 

16-3742 17-1591 

If we take 25 years as the 

origin and 

5 years as 

the unit, the value 

required will be u . 

4 - 




X 


Au, 

A*u, 


A‘u, A‘u, 

— 2 

10-3797 

2-0825 




— I 

12-4622 

1-6317 

- -4508 

•0977 


0 

14-0939 


-•3531 


--0215 



1-2786 


-0762 

•0054 

I 

15-3725 

1-0017 

— -2769 

•0601 

— -0161 

2 

16-3742 

— •2168 





-7849 




3 

17-1591 
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The Gauss forward formula is 

«« = «o + .r(i| A«o +ar(2) A*w_, + (jc + 1)(3, A^m.i + (a; + i )(^| A««_2 

+ (a: + 2),5)A=«_2. 

When AC = '4 the successive coefficients are 

•4; --12; --056; -0224; -010752 

and to four decimal places the value of u.^ is 14-6430, which agrees with 
the tabulated value. 

To apply the advancing difference formula we take 15 years as the 
origin and are required to find Uj. when cc = 2-4. 

In the formula 

= Mo + AC,1, AMo + Xii) AX + AC, 8, AX + *14) AX + *(B) AX 

the coefficients are 

2-4; 1-68; -224; --0336; --010752. 

On evaluating the expansion we obtain for Mj-i the value 14-6430 as 
above. 

It will be seen that the two results are in agreement (as indeed 
they must be, since the same values of u are used), and it may be 
asked therefore wherein lies the advantage of using the central 
difference formula. This question will be discussed in the next 
paragraph. 

12 . Consider an approximation to a particular value of based on, 
say, r values out of n available. The error in the approximation, 
as measured approximately by the first neglected term, is least 
when the coefficient of that term is least. It can be shown that this 
happens when the values of upon which the interpolation is 
based range round the space in which x falls, so that ac is as nearly 
as possible central. The central difference formulae give a systematic 
method for building up the table subject to these conditions. 

Again, the central difference coefficients are as a general rule 
smaller and more rapidly convergent than those required for the 
calculations in the advancing difference formula (as will be seen 
in the Example) and, by a suitable choice of origin, the arithmetical 
work may be reduced to a minimum. 

It should be noted that, in the phrase “as measured approximately 
by the first neglected term”, this measure is not theoretically complete ; 
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it is however generally sufficient in practice if the first neglected order 
of differences is constant or is changing but slowly. When this is not 
so it will not necessarily be true that a central difference formula 
beginning with Uq is more accurate than the advancing difference for- 
mula beginning with the same term. [See p. 337 of Sheppard’s paper, 
cited on p. 72.] 

We may even make the result worse by introducing differences of 
a higher order (cf. Chap. V, paragraph 15). In general, however, these 
anomalous cases arise only rarely. 


13 . It should be noted that the greater accuracy of central dif- 
ference formulae as compared wdth the advancing difference form 
is not due to the formulae but to placing * in the central interval of 
the range of the given values. Provided that this is done — which is 
very' important — it is immaterial whether w'e use a central dif- 
ference formula or the advancing difference formula with the same 
k’s. a disadvantage of the latter form is that the coefficients are 
larger and are not tabulated as, in practice, are those of the 
principal central difference formulae. 

14. Relative accuracy of the formulae. 

The relative accuracy of the various central difference formulae 
can be investigated in an elementary manner on the following lines. 

The Gauss forward formula is 

= Uo -I- ) Auo -b AV_i - 1 - (a: -I- 1 ) (3) j -b (r -I- 1 ) ,4 ) A V 2 -I- . . . . 

If we expand by Stirling’s formula as far as a certain order 
of even differences we shall obtain by a simple transformation the 
above formula to even differences, for the same k’s are involved in 
both. Similarly, Bessel’s formula and Gauss’s formula are identical 
to odd differences. Now the Gauss formula involves only ordinary 
differences while the other two series involve mean differences of 
the form ^ (A"k, - b A^m^^i). If instead of proceeding to constant 
differences the series stop short at, say, rth differences — which are 
not constant — the use of any of the formulae will involve an error. 
It remains to examine which of these formulae gives the best result 
in different circumstances. 
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The following demonstration is based on that given in greater 
detail by Mr D. C. Fraser vol. l, p, 25 : 

Suppose that x is not greater than •5. Then, by calculating the 
coefficients in Gauss’s formula, it will be found that for positive 
values of x none of the coefficients (except that multiplying Attg) 
differs greatly from + -5 times the preceding coefficient. (See 
Table, JJ.A. vol. L, p. 25.) Thus the terms after that involving 
the third difference are approximately equal to 

(x+i)u)(A<+^A") «_ 2 , 

i.e. to (*+ 1)(4) I (A*m_ 2 + A'‘w_i). 

If therefore we substitute J (A*a_2 + A*m_j), the mean difference in 
line with uj, for A*«_2 in Gauss’s formula, we make the formula 
very nearly correct to fifth differences, without having to calculate 
the actual coefficient of the fifth difference. The substitution there- 
fore greatly improves the accuracy of the formula. 

When, however, the substitution is made, it will be found to 
reproduce Bessel’s formula to the fourth mean difference. There- 
fore B essel’s formula to fourth mean d ifference s i s usually more 
accurate than Stirling’s t o the fourth differ ence. 

It irt'ay be shown similarly that Stirling’s formula to odd mean 
d ifferences is usually more accurate than BesseFs to the same order 
o f differences. 

The above demonstration is only approximate ; a strict investigation 
into the relative accuracy of central and advancing difference formulae 
requires rather more elaborate mathematical discussion. (See Whit- 
taker and Robinson, Calculus of Observations, p. 49; Lidstonc, T.F.A. 
vol. IX, pp. 246-57; Fraser, jf.I.A. vol. l, pp. 25-7; Sheppard, 
Proceedings of the London Mathematical Society, Series 2, vol. iv, pp. 
320-41 and vol. x, pp. 139-72.) 

Mr D. C. Fraser has given the following criteria summarizing 
the properties of interpolation formulae : 

(i) Formulae which proceed to constant differences are exact 
and are true for all values of n whether integral or fractional. 

(ii) Formulae which stop short of constant differences arc 
approximations. 
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(iii) Formulae which terminate with the same difference are 
identically equal, for they involve the same u’s. 

It should be noted that these rules are quite general; they apply 
to all formulae based on finite differences, not ending with mean 
differences. 

15 . Apart from the general superiority of central difference 
formulae certain of the formulae possess distinct advantages in 
special circumstances. For example, for th e bisection of an 
interval Bessel’s form is convenient, si nce the alternate te r ms a re 
zero. We have, at once, 

= H“o + “i) - i + ^*“o)] + tIs [i (‘^*“-2 + - ■ • • • 

Again, in using Everett’s formula for the subdivision of inter- 


vals the terms 

are such that they may be used twice : they occur 

both in an “x 

” expansion 

and in 

reverse order in the next 

expansion. An example w 

ill make this clear. 

Example 2 . 





Given x 

30 35 

40 

45 

SO 55 60 


771 862 

lOOI 

1224 

1572 2123 2983 

obtain values for for all 

integral values 

of X between x = 40 and 

X = 50. 





The difference table is 




X 


Au, 

A*u, 

A’u, A‘«, 

30 

771 






91 



3.3 

862 


48 




139 


36 

40 

tool 


84 

s 



223 


41 

45 

1224 


125 

37 



348 


78 

50 

1572 


203 

28 



551 


106 

55 

2123 


309 




860 



60 

2983 
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Employ Stirling’s formula to obtain successive approximations to 
(1-02125)®®, given 

(i-oi)®®= 1-64463; (i-02)®* = 2-69159: (1-03)“ = 4-38391; 

(1-015)“ = 2-10524; (1-025)“ = 3-43711. 

7. Find formulae true to third differences for the bisection of an_ 
interval 

(i) in terms of the two nearest values of the function and of 
differences of the functions ; 

(ii) in terms of values of the function only. 

Apply either formula to find ^ 3 S> given the values of at 20, 30, 40, 
50 to be 1313, 1727, 2392, 3493 respectively. 

8. Given the table 

a; 310 320 330 340 350 360 

log a; 2-4914 2-5052 2-5185 2-5315 2-5441 2-5563 

find the value of log 3375 by a central difference formula. 


9. Prove that if third differences are assumed to be constant 


M, = a:Mi-(- 

where y = i - a;. 


X (a;^— 1) 


AX + — 


y(y2-i) 




Apply this formula to find the values of Mj, to Wjj and Mj, to w,,, given 
that «o = 3oio, U5 = 271 o, 1110 = 2285, 1115=1860, iijo=i5^»o, «s!6=i5*o 
and «ao=i835. 


10. From the table of annual net premiums given below find the 
annual net premium at age 25 by means of Bessel’s formula : 

Age Annual net premiums 

20 -01427 

24 -01581 

28 -01772 

32 -01996 


11. Apply a central difference formula to find / (32), given 
/(25) = -2707, / (30) = -3027. /(35) = -3386, / (40) = -3794. 

12. Use Gauss’s interpolation formula to obtain the value of / (41), 
given /(3o) = 3678-2, /(35) = 299S-i. / (40) = 2400-1, 7(45) = 1876-2, 
/(5o) = 1416-3. 

Verify your result by using Lagrange’s formula over the same figures. 
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13. Prove the following formulae for the general case of unequal 
intervals : 

(i) «^ = Ma + vi HA««+ AmJ + M + 

+ ABL \ + A^W/) + • ■ • (Stirling) 

(ii) «x = i («a + “6) + i (^ + ^) + (A^«l + A^«a) 

+ J (^fiC + ABL) A\Ut + ■■■ (Bessel) 

and show how to obtain a general Everett form. 


14. Show that any central difference formula can be developed 
from Lagrange. Apply a central difference formula obtained thus to 
find /(3i), given that /(2) = 2-626; /(3)^3-454; /(4)=4-784 and 
/(5) = 6-986. 


15. Given Uu, «j, u^, Wj, «4, Ws (fifth differences constant), prove that 


where 


+ 


25 (<:-6) + 3 (a-c) 


256 


a = tio + u^-, 6 = c = M2 + M3. 


16. A series is formed by the division of the terms of the two series 
«, I 2 6 24 ... ;/! 

t'j, 4 20 120 840 ... f (n + 3)l 

Obtain an interpolated value for of the new series by a central 

difference formula and compare the result with the quotient of by 
in the component series. 


17. The following is a difference table written down in Woolhouse’s 
notation : 


“-I 

«_i 

«o 

•h 


a_8 


“i 






Cl 


ii;. 


If Oj — 1 («_,+«]) and Co — i (c-i +0)) show that Stirling’s formula 
(to fourth differences) can be expressed as 

u, = Uo + + Dx^, 


where A, B, C, D are functions of a^, b^, c,, d!o ot'ly* 
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18. Prove that in Woolhouse’s notation 

Mi = Mo + *(1) fll + *(2) *0 + (* + 0(8) + (* + 0(*) 

correct to fourth differences. 

19. Show that the sum of the terms of the series u_2> m_i, Mq, Ui, 
can be expressed in the following form 

i 4 uo + B8*tto + C8X. 

where and 8*«o denote the second and fourth central differences of 
u^; and find A, B and C. 

20. By splitting up the fraction of the form 



into partial fractions, show how to arrive at Ux in terms of known values 
of the function of which x occupies the central position. 

21. If tt* = ^Mo + BAMo + CA^_j + ..., i.e. a general expression for 
Ux in terms of central differences, prove by expressing all differences in 
terms of advancing differences of that 

Ml = «0 + ^(1) ^“0 + *(2) ^*“-1 + (* + l)(S) 

obtaining the general term in the expansion. 

22. Show that in the general divided difference interpolation formula 
any two successive terms can be reduced to a pair in Everett form. 
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INVERSE INTERPOLATION 

1 . When performing direct interpolation, values of y corresponding 
to various values of the argument x are given and we are required 
to find a value of the entry y corresponding to a value of x inter- 
mediate between the given values. If it is required to obtain an 
interpolated value of the argument corresponding to an intermediate 
value of the entry, the process adopted is called “inverse inter- 
polation”. In other words, for direct interpolation we assume a 
curve y = passing through the points {x,y) and estimate the 
value of y corresponding to some intermediate value x' : for inverse 
interpolation we have a similar curve but are required to find a 
value of X corresponding to a value y'. 

For certain functions we may obtain the result easily. If 
y = sina;, then a: = sin“^j; if y = A.’®, then x=yL if y = cF, then 
x = \og yjlog a. The required values of x can be calculated im- 
mediately in these examples. 

On the other hand, if the data are simply corresponding numerical 
values of * and y, all that we can write down is a formula such as 
Newton’s or Stirling’s: we must then endeavour to obtain a value 
for X by solving an equation. For example 

y = = (i + Ap Uo = «o + X(i) Amo + Xi^^Ahig + X(3) AX -I- . . . . 

If second differences may be assumed constant we have a 
quadratic equation which can be solved at once. Should this 
assumption be inadmissible, then we are faced with an equation 
of higher degree than the second and the solution of such an 
equation may be very laborious. In these circumstances we resort 
to approximate methods of solution of the equation. 

2. Consider the problem of reversion of series. If 

y — bx + cx^+dx^+ ... 

and we wish to obtain an approximate value for x in terms of y. 


F M A s 


6 
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we may write x=‘By + Cy* 4 - Dy* +... + /iy" + .... The coefficients 
B, C, D ... K ... may then be found by equating the coefficients 
of the various powers of x in the identity 

x = B (6a: + CA'^ 4 - ...) + C (i* + cA:“+ ...)^+ • 

If y is numerically less than unity and we use only n terms in 
the expansion for x in terms of y — i.e. we let 

X = By + Cy^ + . . . + Ky'' 

— we are neglecting only and higher powers. Since y is less 
than unity, the neglected terms will usually be small. 

If however _y = M^ = Mo + X{i)AMo + X(2)A‘^«(,+ ... +x-(„)A"Uo+ ... and 
we wish to find x from this equation, w e cannot w'ith equal safety 
neglect the termsX(,j^i)A"+iuQ+ ..., for these all contain x,x‘^,x^ .... 

It is thus seen that the problem of inverse interpolation, although 
analogous to that of reversion of series, involves considerable 
difficulties. The best method of approach is from a practical point 
of view. 

Given a problem in direct interpolation, the results obtained by 
the use of an interpolation formula are justified only on certain 
assumptions. Similarly, in interpolation by differences for an 
inverse function the results must be judged practically by the 
progression of the differences. It may be stated however that if 
interpolation to nth differences is accurate enough for / {x) it does 
not follow that the same number of differences will suffice or will 
be required for the inverse function. 

3 . The problem of inverse interpolation may be viewed in two 
ways. We may, by graphic or other indication, observe that the 
value of X which we require corresponding to some given value of 
lies in a certain narrow interval. Thus, if we are given the 
following table: 

Rate of interest 

per cent. 2^ 3 3J 4 4^ 5 

Annuity-value 8-7521 8-5302 8-3166 8-1109 T 9^^1 77217 

and we are asked to find the rate of interest for which the annuity- 
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value is 8-ooo, we may take the interval 4-4J as the interval (o, i) 
or ( — 4- i) and write down an interpolation formula, using only 

a quadratic or cubic function. As the interval and the values of 
the variable are small and the differences are rapidly decreasing, 
the solution of such an equation will, in general, give sufficiently 
satisfactory results. If the equation is a quadratic the solution will 
present no difficulty; for a cubic various methods are available, 
some of which are discussed later. 

4. Alternatively, we may exchange the dependent and independent 
variables. That is, given a table of y = u^ we may use the inverse 
function x~Vy. 

This method of interchanged variables is subject to very severe 
restrictions on the function u^. over the range of values used. In 
the first place, must be strictly monotonic — i.e. uniformly in- 
creasing or decreasing — over the range of values given and the 
unknown value of x, the value of x in all practical cases lying not 
outside, but within the given range and near the middle of it. If 
Uj, is not strictly monotonic, the inverse function becomes two^ 
valued at least — possibly many-valued — and hence cannot be 
repr’eseriled by a polynomial. In these circumstances the ordinary 
methods of finite difference interpolation are unsafe. 

In consequence, before this method of inverse interpolation can 
be attempted we must have some extraneous knowledge, graphical or 
otherwise, of the nature of the function Uj.. We must usually, in 
fact, be able to see roughly the position of x. These conditions 
being premised and being generally satisfied, the necessity for a 
sufficient number of values of and a small enough interval is 
naturally seen. 

5. The point here made is clearly brought out by a consideration 
of the following example. 

Example l. 

Obtain a value for x when = 19, given the following values : 

X o i 2 

Ux O 1 20 

6-a 



84 FINITE DIFFERENCES 

There would seem to be two possible methods at our disposal : 

(i) We might write down at once (if we think it safe to assume that 
second differences are constant) 

>- = U* = (l +A)*«o = Wo+*(1)A|<o + JC(2)AS; 
i.e. + 

so that 9Jc^ — 8 jc— 19 = 0, 

from which *= 1-964 ... or -1-075.... 

(ii) Since we have to find an interpolated value of x corresponding 
to a value of y we might treat y as the argument and a: as the entry. 
Let us write the data in the form 

y o 1 20 

= o 1 2 

and apply the Lagrange formula to calculate Ujg as if for direct inter- 
polation. 

We shall have 

Ojg _ I 2 

19. i8.(- i) r8. 1 .(- 19)''^( - i).20. 19 

or f jg = 2-8. 

It will be seen therefore that we have obtained two distinct sets 
of results. By adopting the first method x has the values 1 -964. . . 
or — 1-075... adopting the second method x has the unique 

value 2-8. It remains to examine the reasons for the difference and 
to ascertain which result, if any, is more likely to approximate to 
the true interpolated value or values. 

In the first method it will be apparent that we have taken a 
curve of parabolic form, y = a + bx + cx^, and have obtained 
values of x corresponding tojy=i9. This gives two values of x, 
one on each side of the vertex of the parabola. In applying the 
Lagrange formula inversely we have assumed that jc is a quad- 
ratic function of y and have given y a particular value (19) in the 
equation x = If we substitute the value of y corre- 

sponding to each value of x from the data, it is easily seen that 
a = o, 13=398/380 and y=— 18/380. The Lagrange equation is 
therefore 1 90X = 1 99^ - 9y*. 
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Now if the two curves 

y =» 9JC* — Sac 


and 


x = y - 

190"^ 


190 


be plotted on the same graph, it will be seen that they take different 
shapes, thus: 



Fig. I. 

On the curv'e v = 9a:® — 8a: the abscissae of the points whose 
ordinates are 19 are i'96... and — i-oy..., whereas on the other 
curve there is only one point for which the ordinate is 19, namely 
the point (2-8, 19). Unless therefore the two curves obtained from 
the data, (i) by treating x as the argument and (ii) by treating v as 
the argument, intersect at the required interpolated value, as for 
example at K in the above figure, the two methods are bound to 
give different results. 


6. Although there would seem to be three different and possible 
answers to the question above, we must be very careful before we 
draw any conclusions from the results. 

In the first place, we are not told, and have no right to assume, 
that is constant, and that consequently second difference 
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interpolation for y is sufficient. On the data as given the only 
conclusion would be that they are inadequate either for direct or 
inverse interpolation. 

Secondly, even if we may assume that is constant, we see 
that 

(fl) a portion of the curve over the given range, namely from 
jc = o to a: = I , is non-monotonic and one of the essential conditions 
set out in paragraph 4 is thereby infringed ; 

[b) notwithstanding the assumption that second differences may 
be used for direct interpolation, we are not justified in assuming 
that this order of differences is sufficient for inverse interpolation : 
we cannot therefore safely use Lagrange’s formula. 

If we make the assumption that second differences are constant, 
we cannot properly use the point (o, o) for inverse interpolation, 
as the given values do not indicate the form of the curve between 
x = o and * = i . By omitting the value (o, o) and using the values 
for y when a: = i , 2, 3 and 4, a more satisfactory interpolated value 
can be found. The student should attempt this by one of the 
methods described below. 

7. The formulae of direct interpolation are based on the properties 
of rational integral functions of the variable, and any formula which 
proceeds to nth differences gives exact results when applied to a 
rational integral function of the nth degree. By stopping short of 
nth differences the formula can, of course, be used to obtain 
approximate results, and the success of the interpolation depends 
on the magnitude of the terms omitted. Thus, if we use rth 
differences for a polynomial of the nth degree in x, the result is the 
exact value of terms up to and including the term in xT. The terms 
beyond xT are disregarded, and this can only be done legitimately 
if they are relatively unimportant. 

In questions of direct interpolation there is only one value of y, 
i.c. of u^, for a given value of x. There may be, however, more than 
one value of x for a given value of y. In fact, if ^ is a polynomial 
in X is a polynomial in y only when both functions are of the 
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first degree. In other cases the inverse function may be an infinite 
series or an irrational function. For example, in the Table of 
Mortality there is only one value of for a given value of 
(where represents the number of persons attaining exact age x in 
any year of time, and is the number of these who die before 
reaching age x + r). In the neighbourhood of the peak of the death 
curve, however, there will be two values of 4 within a short range 
of interpolation for a given value in d^. 

For these reasons the subject of inverse interpolation is more 
troublesome than that of direct interpolation, although it should 
always be remembered that the conditions attaching to the differ- 
ences of Uj. for direct interpolation are the same as those attaching 
to the differences of Vy for inverse interpolation. One principal 
condition is that within the range of interpolation there should be 
only one value of x corresponding to a given value of y. 

Let us consider the equation in Example i, namely 
jV= -8* -1-9*®, 

where the range of interpolation is from o to 2. The first point to 
note is that the function is not a good subject for direct interpola- 
tion except when the formula is applied to its fullest extent — the 
second degree. The reason is that the last term is the predominating 
term throughout the greater part of the range. 

In most instances, by altering the interval and reducing the 
range of interpolation, a function can be reduced to a good form 
for direct interpolation. Such a question as the following might 
be put; 

Given the function j = «= — 8x + 9.T®, for what intervals of 

X should Uj. be tabulated so that in any interval an interpolated 
value of y can be obtained by first difference interpolation with an 
error less than, say, 001? 

Put x = zla; 

8a 9a* 

then + 

i.e. = — Sax + qa®, 

a*An,= — 80 -(- 1 8* 9, 
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1 8, 

Suppose Vg to be tabulated for values of z at unit intervals. 
Then, for an interpolated value between Vg and 

«.+* = t'a + ^^Vg +it(t-i) Ahjg. 

If we take Vg + tAVg as the interpolated value, there is an error 
Ahjg and the maximum numerical value of i) is 

being the value when t is 

Therefore, by the conditions, 

(^ — i) ooi. 

But i 8/(2^ r8/8<z®< -OOI ; 


i.e. 

> 18000/8, 

i.e. 

> 2250, 

or 

a>^ 22 $ 0 , 


and the most convenient value for a is 50. 

We must therefore tabulate at intervals of 5’^ of unity, i.e. at 
intervals of -02. 

For example, ,, 

J = “x- -8 x + 9v‘‘. 


X 

2 


At'. 

I'lO 

55 

2-0900 

•2396 

I-I 2 

56 

2-3296 

-2468 

I-I 4 

57 

2-5764 

•2540 

I16 

58 

2-8304 

-2612 

i-i8 

59 

30916 

-2684 

1-20 

60 

3-3600 



Both for direct and inverse interpolation this table is better 
than one proceeding by larger intervals. It must not be assumed. 
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however, that a table which is good for direct interpolation is 
necessarily good for inverse interpolation: in particular, inverse 
interpolation always presents difficulties if Au^l^ is small. 

'8^ Practical methods of inverse interpolation. 

It is evident that the problem of inverse interpolation is the 
same as that of direct interpolation for unequal intervals. The 
methods of Lagrange or of divided differences can therefore be 
employed to obtain any intermediate value of x corresponding to 
a value of y, given a table of y = u^, by the use of the inverse re- 
lation x = Vy. Alternatively, we may treat the problem as one 
involving the approximate solution of an algebraic equation. Two 
of the methods often adopted in practice are given below. They are 
simple to apply and generally lead to satisfactory results. There 
are, however, certain objections to the methods — for example, it 
is difficult to ascertain the degree of accuracy which has been 
reached in the approximate answers — and for many purposes the 
most convenient and practical plans are probably Aitken’s methods 
of cross-means, to which reference is made in Chapter VUI. 

9 . Successive approximation. 

In the first place we obtain either by inspection or by a rough 
graph two values of x lying on either side of the required inter- 
polated value. (For example, a value for x when y is zero in 
the function y = x'^ — ^ + 2, i.e. a solution of the equation 
,v- — 4x-i-2 = o, lies between the values x=o and x=i.) We then 
choose a suitable origin and unit of differencing so that if x be 
the interpolated value and lies between two successive values of 
the argument, the interval will be small and x will be as near to 
the origin as possible. 

Suppose that the required value lies between o and 1 . 

The method proceeds as follows: 

Ux = U^ + xAjI^ + ^X (x — i) A^Uq+Ix (x~ i) (x-2) A^u^+ .... 

Since x is small, a first approximation ((Xj) will be obtained by 
neglecting terms involving second and higher differences of Uq. 
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«^ = «o + «! Amq approximately, 

i.e. aj = (u^ — Uq)/A«o, first approximation. 

Again, neglecting third and higher differences, we may write 

«* = «o + («i - i) AXi 

where Oj is a second approximation and is therefore not very 
different from otj. This gives 


— M, 


Similarly 


flt, = -7 . - - , second approximation. 


*3=- 




A«o+H* 2- J)^X + i(«2-i)(a2-2) A*Uo’ 

and 80 on. approximation. 


10. Ellmiiiatioii of third dlfferencea. 

We have, as far as third differences, by expressing in terms of 

Uq, Auq, . . . , 

a, = + xAuo + (x - I ) AX + s* (* - i ) (* “ 2 ) AX- 

Also, in terms of u^, Auj, ..., 
a^ = Ui + (x- i) Auj + J (x— i) (x-z) AX 

+ H^- 0 (x-2) (jf-3) AX- 

If now we ignore the terms containing third differences and 
multiply both sides of the first equation by 3 — a and both sides 
of the second equation by a (where a is an approximation to the 
required value, found by inspection or otherwise) and add, a new 
quadratic equation in x will be formed. The error involved in 
ignoring the third differences will be small, since 

(x- i) (*- 2 ) (3 -a) AX + i« (*- i) (*- 2 ) (*- 3 ) 

will be small provided that AX and ^X are not very different. 

11. The following question is solved by both these methods: 

Example 2 , 

Find the value of * for which y is i8,6oo, given 

* 52 53 54 55 56 

y 19,231 18,868 18,519 18,182 17.855 
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Changing the origin, the difference table is 


y 

0 19,231 

1 18,868 

2 18,519 

3 18,182 


Aj 

A*y 

-363 

H 

-349 

12 

-337 

10 


-327 


4 17.855 

By the ordinary advancing difference formula 


-2 

-2 


18,600= 19,231 —363^; + 


14J: (x—i) 
2! 


2 X (x-i) (a -2) 

3! 


where the value of x is required corresponding to the value 18,600 of y. 
(i) Successive approximation. 

Since x is small, a first approximation will be 


19,231-18.600 _ 

= or 17383-- 


Including the next term. 


aj = 


631 


363-7K-1) 
631 


Similarly, 

“ 3 b 3 - 7 («* - i) + i («a - I ) («9 - 2) 


or 17634.... 

or 17646... 


The required result is therefore 52 + 17646... =537646..., where, 
it should be noted, the last digit is uncertain. 

(ii) Elimination of third differences. 

We have 

i8,6oo = Mj = (i+A)»u„=i9,23i-363x + 7*(*-i)-|*(*-i)(*-2) 
and 

i8,6oo = «, = (i +A)*-> = 18,868 -349 (*- i) + 6 (*- i) (x-z) 

-i (*-i)(*-2 )(*-3). 
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By inspection a rough value of the interpolated value is i-ys, allowing 
for the change of origin. 

If, therefore, we multiply the two equations by (3 — 1’75) and f75 
respectively, and add, we may neglect the fourth term. The factors 
being 1-25 and 1-75 we can use 5 and 7: we thus obtain 

12 X i8,6oo = 5 [19,231 -363JC + 7* (*- i)] 

+ 7 [18,868-349 (jc- i) + 6 (*- i) (*-2)] 

or 223,200 = 230,758 — 4419^ + 77*^, 

i.e. 77** -4419x4- 7558 = 0. 

Solving the quadratic, the value required is x= 1-7646..., which 
agrees with the value of in method (i) to four decimal places.. 


12. In Example 2 the advancing difference formula has been used ; 
in practice, however, it is more usual for many reasons to employ 
a central difference formula as the basic equation. Central differ- 
ence formulae are more convenient because the coefficients are 
smaller and converge more quickly; this in itself is a decided 
advantage. Having placed x in the middle interval the first approxi- 
mation is generally much more accurate (thus, in Example 2, the 
use of Stirling’s formula gives 

aj= I 4-268/356 

= i-753)- 

Also, X is smaller, and the maximum number of significant digits 
which the data will yield may mean an additional place. 

Consider the following example. 

Example 3. 

Find the root of the equation x® — 9*— 14 = 0 which lies between 
3 and 4. 

Let>' = x® — 9x- 14. Then we have, by actual calculation, 

X 3-0 3-2 3-4 3-6 38 4-0 

y —14-000 —10-032 —5-296 -256 6-672 14-000 
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The difference table is 


X 

y 

^y 

A‘y 

A* 3 - 

3-0 

— 14*000 

3-968 



3-2 

— 10-032 

4-736 

•768 

-048 

3-4 

- 5-296 

5 -S 52 

•816 

•048 

3-6 

•256 

6-416 

•864 

-048 

3-8 

6*672 

7-328 

•912 


4-0 

14*000 





Taking the origin at 3-6 and using Stirling’s formula: 

AUfl + A«_, ^ X (x'^-i) A®u_, + A®«_, 

= + -- + - A-a_,+ V 

2 2 6 2 

the interval of differencing being o-2 ; 

i.e. 0 = -256 + 5-984^: + -432*- + -ooSx (.r-- i). 

The cubic equation can be solved by successive approximation, or 
we can repeat Stirling’s formula for the next value of and adopt the 
alternative method outlined above. 

If the first of these methods be adopted, it will be found that 
successive approximations to the value of x are —-04278, —-042913, 
— -042971. From the last we obtain as the required solution 

3-6 - ( 042971 X -2) or 3-5914058, 

which is correct to six decimal places, the seventh being nearer to 7. 

If we choose our origin at the point (x, j) and the value of the inter- 
polated value is * -(- a, then, w hen a lies between — j and ^ it is advan- 
ta geous to us e Stirling’s forn iula. If a lies_tytween |~and | Bessel’s 
formula should be applied . (Whittaker and Robinson, Interpolation, 

p. 65:1 

13 . In general, if has n significant digits we cannot rely on 
more than n significant digits in x. Even then the last digit is 
doubtful and may in fact be misleading. In Example 3 above w-e 
have been able to obtain the fifth significant digit in -042971 from 
a four-figure value of Aj only because y and Ay happen quite 
unusually to be exact and not rounded off. 
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14. The method of successive approximation is a convenient 
method for ordinary use. If we want a result that can be obtained 
to the required degree of accuracy by taking out differences as 
far as A®«, or if the curve is a cubic, the elimination method will 
give a satisfactory answer. The disadvantage of this process is that 
we cannot approach our interpolated value by steps as is done in 
the method of successive approximation. Moreover when fourth 
and higher differences are not negligible the elimination method 
breaks down. 


15. The general investigation of the accuracy of finite difference 
methods of approximation is a problem in direct interpolation, 
and has been dealt with previously. In dealing with the subject 
of successive approximation, however, i t is of interest t o includ e 
in^thejjjg^s ent chapter an elementary illustration pf the fact t hat 
in ceruin circumstances^ hotter in terp olation ^n be obtained ^ 
neglecting higher differences than by retaining them. 

For example, if we have a third difference curve, then 

= Uo + X(i) A«o + X( 2 ) A®Mo+ A®Mo exactly. 

The error (a) in taking two terms is X( 2 )A®«o + Xi 3 )A®«o, 
and (P) in taking three terms is X( 3 )A®m„. 

(a) may be expressed as 

X (x — I ) (,v — 2 ) I 


[_X — 2 A®Mo 


A»k. 


0 * 


and (/?) as 


x( x-l)(x-2 ) 

6 ^ 


Then, ignoring the sign of A**/^, which will 

be the same for both (a) and (jS), (a) will be less than (^) if 

3 

is negative and numerically less than 2. 
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In these circumstances the error made by retaining first differ- 
ences only is less than that made in continuing to second differences. 
As an illustration consider the function ** -f 5^: + 50. 

It is easily seen that u„ = 5o; Auo=6; A^Mo = 6; A®Mq= 6. If 

jc is for example, which is negative and less 

X 2 ^ Hq 


than 2. 


(tt) is therefore less than (^). 

This can be otherwise shown by finding the values of the errors. 


"} = 5 Inexactly. 

Also = ( I + A)^ Mq 

«o + JAuo=54 = 5i||-. 

Uo -t- 1 Amo - 52 AX = 50 T 6 = 5<>l i . 


5i||-5i-H=M. («) 

(^) 


(a) is less than (,8), so that the approximation to first differences 
is better than that to second differences. 


16. In conclusion it should be emphasized that the accuracy of 
the result depends greatly on the fineness of the interval of 
tabulation. In cases of doubt it may be desirable to halve the 
interval before applying the process of inverse interpolation. This 
will reduce first differences by about one-half, second differences 
in the ratio of about i : 4, third differences in the ratio of about 
I ; 8, and so on. The comparative effect of higher differences is 
therefore much reduced. 


EXAMPLES 5 

1. Given Mi=o, M2= 1 12, U3 = 287, 1/5 = 612, find «4. Using Uj, «, 
and 114, find a value for x when Kj, = 270. 

2. The following values of u,. are given; «io = S44, Uj5=i227, 
“ao = *77S- Fittd, correct to one decimal place, the value of jc for which 
«,= 1000. 
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3. Having given log 1=0, Iog2 = -3oi03, log 3 = -477i2 and 
log 4 = -60206, find the number whose logarithm is -30500 : 

(i) by expressing log x in terms of log i and its differences and 
solving for jc; 

(ii) by using Lagrange’s formula applied inversely. 

Explain the nature of the assumptions in each case. 

4. Apply Lagrange’s formula inversely to find to one decimal place 
the age for which the annuity -value is 13-6, given the following table: 

Age jc 30 35 40 45 50 

Annuity-value at 4^ per cent, 15-9 14-9 14-1 13-3 12-5 

5. /(o) = i 6-35, /(5) = 14-88, /(io)=i3-S9 , /(i5)=i 2-46. Find jc 
when / (jc) = 14-00. 

6. Given the following table oif(x) : 

/(o) = 2i7, /(I) = I 40 , /(2) = 23, /(3)=-6, 

find approximately the value of x for which the function is zero. 

7. The following values of / (x) are given : 

/(to) = i754, /(i5) = 2648, /(2o) = 3564. 

Find, correct to one decimal place, the value of x for which / (x) = 3000. 

8. Given four values of a function 1%, Mj, Mj, 1/3, show how to calculate 
an approximate value for x from the equation 

= “0 + *Atto + — TT ' + ; t ^ “0 

by obtaining a quadratic equation in place of a cubic. 

Use the method to find x when Mj.= i-o 5, given «i.3= 1-0000, 
«!., = 1-0323, tti.2= 1-0627, u,.3= 1-0914. 

9. Given that (1-20)^=1-728, (1-21)® = 1-772, (1-22)®= 1-816, 

(1-23)®= 1-861, (1-24)^=1-907, explain carefully how to find the real 
root of the equation x® + x — 3=0 by a method of inverse interpolation. 
What method would you adopt in practice? Obtain a value for the 
root to four decimal places. 

10. The following table is available : 

Age X 44 45 46 47 

at 4J per cent. i3'40 1316 12-93 12-68 

Find, to two decimal places, the age corresponding to an annuity of 
13-00. 
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11. Find, to two decimal places, the real root of the equation 

+ — 5=0 

by means of divided differences applied inversely, using the values of 
the expression when * = o, i, 2 and 3. 

What is the reason for the poor result obtained in this case? 

(The true solution is *= 1-516 approximately.) 

12. The equation jc® — 6* — 11=0 has a root between 3 and 4. 
Obtain it by inverse interpolation correct to three places of decimals. 

13. The formula for the value of an annuity-certain for « years at 
rate per cent, i is given by 

I — 

fls] = — ; — , where o = (i -I- 1')~*. 

Given the following table, obtain to three decimal places the rate per 
cent, for which is 14; 

Rate per cent. 3 3^ 4 4i 

‘*201 14-8775 14-2124 13-5903 13-0079 

14. Solve the equation *= 10 logj^ x, given the following data: 

Arguments: 135 1-36 1-37 1-38 

logx ... -1303 -1335 -1367 -1399 

15. Apply Lagrange’s formula (inversely) to find a root of the 
equation «* = o, when -30, 1*34= -13, «3g = 3, W42= 18. 


P M A S 


7 



CHAPTER VI 


SUMMATION 

1. Certain series whose law is given or of which there are sufficient 
terms to enable the law to be assumed may be summed by the 
methods of finite differences. 

By definition we have 

f{a + h)-f{a) = \f{a) = ii>{a), say, 
f{a + ^h)-f{a + h)^^f{a + h)^<f>{a + h), 


f (a + nh) -f(a + n— ih) = A/(fl + n — i/i) = (/> {a + n — ih)) 
f {a + nh)-f {a) = <f>(a) + if>{a + h)+ ...+(f> (a + n-ih). 

If therefore / (a:) is the function whose first difference is (f) (x) 
we can find the sum of any number of terms of the series whose 
general term is <f> (x) in terms of values of / (x), for any given 
interval of differencing. 

By a suitable change of origin and scale we can make the interval 
of differencing unity and the first term of the algebraic series under 
consideration ^ (i). On putting a and h each =i, the required 
relation then becomes 

S<^(^)=/(n+ !)-/(!). 


The expression /(n+ i) -/ (i) is sometimes written in the form 

This represents the process of substituting n + 1 and i for x 
successively in/ (x) and deducting the second result from the first. 


Note. It should be remembered that 2 <f> (*) leads to 

a 

lb 


and not to 


/(*) 


/W 


6+1 
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2 . Although any function of x can be differenced, there is only a 
limited number of functions which are the first differences of other 
explicit functions. The principal forms of such functions are given 
below. 

(i) It can be easily seen that, since Aa-' = (a— i) 



.'. is the function whose difference is a®. 

a~ I 

ft 

We can therefore find H Uj. by the method above if is of the 
1 

form kc^. 

(ii) The relations and Ax(„,) = enable the 

sum of any series whose nth term can be expressed in the factorial 
notation to be summed immediately. 


Example i. 

Sum to n terms the series whose xth term is x (x-i) {x-z). 
Now (x— i) (.V — 2) = sc'®*, 

and since Ax*** = 


2 x'*' = 

I 



n+l 

1 


+ 1 ) ” («- 1 ) ('*- 2 ). 


since the product of four successive terms of which i is the first includes 
o and is therefore obviously zero. 

If the interval of differencing does not happen to be unity the identity 
A**’"* may be applied, but care must be taken in doing so. 

Here is the difference of the function x^™^jmh, so that in summing 
a series whose A:th term is, for example, zx (zx — z) (2*— 4), we must 
divide 2* (2x: — 2) (2* - 4) (2x— 6) by A = 2 as well as by m = 4 before 
taking the limits n + i and i. 

(iii) Any polynomial P„ (x) of the nth degree can be expressed 
by an interpolation formula ending with A^m^. in which each coeffi- 
cient is of the form X(„,, or \ [x<,„) + (x-l- 1)(„,)]. 


7-2 
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Since *(r) = (^+ l)<r+l)-^(r+D 

it follows that 

Each term of the formula can therefore be stimmed at sight. 
Thus, the advancing difference formula 

“a + JCd) + JC (2) + X(3) + . . . 

yields j:(i)K„+X(2)AM„+;»:(s)A^Mo + a;(4,ASa^+ .... 

The Gauss forward formula in its simple form is 

«x=“o+'*(1)AMo + ^(2)A^M_1 + (*+ i)(3)AVi+ ... 
and therefore 

= ^<i)“o + ^(2) AUfl + ^(3) A^_1 + (a: + 1 )(4) A»«_I + . . .. 
Stirling’s formula may be written in Sheppard’s notation as 

= Ua + X(i)p8Uo + H(^ + I ) (2) + ^(2)] SX + (* + I ) (3) fiSX + • • ■ ; 

whence 

S«x = A!u)Mo + *(2)M^“o+i [(*+ l)(3) + *(3)] + (x i),^) fJ^^UQ+ .... 

Example 2. 

Find the sum of n terms of the series o, lo, 33, 150, ..., given 

that third differences are constant. 

By taking out the differences it is seen that Au= 10, A*tt= 13 and 
A»u = 8. 

The advancing difference formula gives 

tt^ = o + iox(i)+i3X(2,+8x(3). 

n-l 

If the series is «g, Uj, ... ... u„_^, then we re(;juire E u^. 

0 

n— 1 n—l 

S E [ioX(j) + 13X(2) + 8 x( 3)] 

0 0 

I0X(„+13X(3) + 8*(4)J^ 

= io«(3)+i3n(3) + 8«(4) 

= Jn (n- 1) (2n* + 3n+ 16) On reduction. 
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Alternatively, we may continue the table backwards to obtain 
the differences required for the central difference formulae. 
Stirling’s formula then gives 

2w^ = 7 - 5 X(j) + 5 .i [(^c+i)( 3 )+*{ 3 )] + 8(jc + i)( 4 , 
n— 1 

and S Mi = 7-5n(2,+ 2-5 [(«+ i)(3) + n(3)] + 8 (n+ i),*) 

0 

= ^n (n — i) (2«*+3n+ i6) as before. 

Should the series be given in the form Mj, u ^, ... u ^. ... then 

= Ml + (* - i)(i) A mi + (oc - 1)(*) A^Mi + (j; - i)(3) A^Mj. 

.-. SM^ = (A:-i)(i)Mi + (^-i)(j)A«i + (jc-i)(3)A^ + (a;-i)(4,A®Mi 

and 


1 


I 


(x-l)(nUi + (v 


I ) (2) Amj + (ar - 1 ) ,3 ) A^Mi + (* - i ) (4 , A»Mi 


] 


n+1 

1 


= nu)Mi + «(.2) Aui + n(3, A^Mj + n(4) A^Mj, 

which, on substituting for Aui, A^Mi and A®Mi, gives the result 
obtained above. 


3 . Since Av<"”^ = the scries whose .tth term is of the 

form [(jc+ i) (Ar + 2) ... + can be summed immediately. 


Example 3 . 

Sum to n terms the series whose arth term is 


(x + i)(a: + 2 ) (* + 3 )’ 




\{x+l){x + 2) 

I 

I (* + l)(* + 2 ) (* + 3 ) 


(v + l)(x + 2) {X + JY 

r I 

2 [(3:+i)(v + 2 )J, 

I l1 

(n + 2)(n + 3) 2.3j‘ 


4 . It is worthy of remark that the rules for the summation of series 
of the types given in paragraphs 2 (ii) and 3, as given in the text- 
books on Algebra, are precisely the same as those stated above, and 
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are based on the same principle. For example, for a series whose 
nth term is 


{a + nb) {a + n+\b) {a + n + zb) ... {a + n + r- ib), 

the finite difference method is simply to write this in factorial 
form, with interval of differencing b, and then to proceed on the 
lines laid down, thus: 


n n 


Su„ = i;(a + n + r— 1 
1 1 


^ L Mr+i) 1 


(o + n + rZ>y''+^' — (fl + rty+i* 


(a + n + 'rb)u^ 
b(r+i) 


constant. 


This, of course, produces the same result as is given by the 
algebraic rule: “Write down the nth term, affix the next factor at 
the end, divide by the number of factors thus increased and by the 
common difference, and add a constant” (see Hall and Knight, 
Higher Algebra, p. 314). 

For the series whose nth term is the reciprocal of the one above 
the inverse factorial is used, and a similar result is obtained. 


5. It sometimes happens that on taking out successive differences 
of a series a stage is reached where a particular set of differences 
forms a geometrical progression. In that event the series can be 
considered as consisting of two separate series, (i) a series whose 
general term is a + + cx* + . . . + (a rational integral function 

of x), and (ii) a geometrical progression. 

Suppose, for example, that second differences are in geometrical 
progression with common ratio r. Then 
«j = a+6x4-cr*. 

For AUj. = 6 + c (r- i) r®, 

and = c (r— x)® r® = ^, 

where A = c (r — i )®. 

It follows that for nth differences 
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Example 4. 

Sum to n terms the series 1, 6, 11, 18, 31, 58, 115, .... 
The difference table is 


I 
6 
1 1 
18 

31 

58 

115 


Third differences are in G.P. with common 
ratio 2. 

A*u^ A*u, Assume therefore that 


5 

5 

7 

13 

27 

57 


o 


6 

14 

30 


+ da®, 

A«j. = i + 2<r*<** + d2® (2 — 1) 

2 =6 + 2«*'* +d2®, 

= 2C + da®, 

4 A^Uj. = da®. 

g Inserting the differences for the value 
Jc = 1 , we have 

16 2 = A%j = 2d; o = A“aj =2c + 2d; 

5 = Auj = /) + ac + ad. 

From these equations we find easily that 
d=i; c=-i; 6 = 5. 


Putting jc=i in a, = a + 6**’' 4 - + da®, we have 
I = a, = a 4- 4- ad, 
whence a = i—b — 2d= - 6; 


^Ux = 
1 


aj= — 64-5****-^:*^’ + 2*; 

r 111 5'*'*’'* 2® 

-6jc'‘> 4- h 

2 32-1 


= -6(w 4- iy»4-64- ^ ^”±^)'^’ _^!!±ij!!’+2"+>-2, 


since all the factorials except the first vanish for the lower limit. 
This simplifies to 


2"+^ — 2 — ^ (aw® — I5n4- 19). 


Alternatively, we may proceed thus : 

A®a, = d2®. 

Deduct da* from and difference the function u^ — dz* in the 
usual way. 

By giving * the values i, 2, 3, 4 in succession it is easily seen that 
a^-da* takes the values - i, 2, 3, 2 respectively. On forming a differ- 
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ence table we find that the leading differences are 


AK-d2“) = 3, -2. 

Then tt, = 2“= - 1 + 3 (a; - i),,, - 2 (* - 

so that 

n r “In+t 

2*-(*-i),„ + 3(*-i),a,-2(*-i),3, 

1 L Ji 

= 2"+^ - 2 - «(i, + 3«(2) - 2«,s,, 

which, on simplification, gives the same result as that above. 

Note. In the methods above we have called the terms Wj, u^, Uj ... 
and have used the property = We may shorten the work 

slightly by beginning with x = o (so that all the factorials vanish at the 
lower limit) and using instead of a:*'*. Further, the two methods 
give the same result without transformation. It will be instructive for 
the student to rework the problem on these lines. 

6. The form Summation by parts. 

When the general term of the series is the product of two 

n 

functions of x and the value of each of the summations Uj. and 

1 

n 

S Dj is known, a method known as “summation by parts” can be 
1 

adopted. 

We have A {U^V,)=^ 

= U,^,AV,+ V,AU,-, 

.-. V,AU, = AiU,V,)-U,,,AV,; 

It follows that when the function u^v^. can be put in the form 

n 

V^AUj. the summation can be performed at once if S [U^_^iAV^] 

1 

can be evaluated (but not otherwise). 

Note. For extensions of the formulae for summation by parts see 
Chapter VII. 
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Example 5. 

Find the sum of the series a + 2a* + 3a® + 4a* + . . . to n terms. The 
terms are successive values of the function y = xa^, and since 


A 



we may write * for and for in the relation above. 


a— I 


.-. 2[jca*]=2 

1 1 






n+l 

1 


X 

a — i 


X l'2-i 
n+l n r /7X+I n 


since Ax—i, 


a - 1 


n+l 


a®+l 




n+l 


fjn+l 


a a 


<n+Z 


n+i) — :o + 

' ^ V m \m 


a-i a-1 (a-i)- (a-i)® 


7 . The result in paragraph i, namely, that ^<f(x)~f(n + i)-f(i), 

1 

where A/ (a:) = (x), can be obtained by the use of the operator E. 

We have 4 > {x) = <f) {1) + <f> {2) + . . . + 4 > {n) 

1 

= (i+E+ii- + ... + £"-i)^(i) 

E^— i 


^(i) 


E-i 

£■"-1 


<A(i) = A/(i) = (£-i)/(i). 

=/(n+i)-/(i). 


since 



io6 


FINITE DIFFERENCES 


n _ j 

Thus the operator S is equivalent to the operator , and 

1 h — i 

— I ” 

we may safely substitute ^ — — for S in any series of operations. 

» _ 2 

Again, since £”1/^ = the identity S — Mi can be 

expressed as 

" 1 

2“x = £r3^(“n+l-«l) 

8. The relation between the operators 'L and A. 

It has been seen that if 

A/(x) = ^(x) 

then / (jc) = (x), 

where the summation is performed between certain limits. 

If therefore we omit the limits we may say that with certain 
reservations summation is the inverse process to differencing. 

Consequently (x)= A/ (x) = AX<f> (x), 

so that AEs 1, 

i.e. E = i=A-C 

A 

Now although AE = i it does not follow that SA = i , for we 
shall obtain the same result by differencing /(x) + c, where c is 
a constant, as by differencing /(x) alone. 

Thus A [/ (x) + c] = ^ (x) ; 

/. /(x) + c = A-i.^(x) = E<^(x) = EA/(x), 

so that EA^i. 

The symbol E may be and often is used in place of the inverse 
symbol A~^ provided that it be remembered that E (or A~i) and 
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A are not commutative, and that in the indefinite finite integral we 
must include, or imagine, an arbitrary constant which disappears 
in the definite integral. 

Thus, the process of summation in finite differences is similar to 
the corresponding process in the integral calculus and the relations 
between the symbols are analogous to those existing between the 
symbols of differentiation and integration. As a result, finite 
difference summation is often referred to as finite integration. 

n 

is said to be the indefinite finite integral of ; S the definite 

1 

finite integral ; and a function that can be integrated, such as a*, to 
be “immediately integrable”. 


9. Other uses of the symbol II. 

One of the commonest functions in the theory of life con- 
tingencies is the expression obtained by multiplying (the number 
attaining age x) by the interest factor r*. This product is denoted 
by Djf, and the connection between certain functions dependent 
on is indicated thus : 

= S, = 2;n^. 

Here 2 denotes summation from age x to the end of the mortality 
table, it being understood that values beyond the end of the table 
are zero. 

In point of fact, the correct way of showing the relation between 
the functions D and N, etc., is 

t~o 

where x is fixed and t is the variable. 

In modern mathematical works it is now usual to use the notations 

CO n 

X , S etc., where the variable t is specified for the lower limit only. 

b 

The still shorter form S is often used where there is no doubt about 

a 

the variable. 
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(i) When S is used in the special sense 

= + + + to the end of the table; 

AS«x = (“x+i + “*+a + “r+3 +•••)-(«* + “*+i + ««+i + • • ■) 

so that here AS = — i. 

(ii) If, however, S is specially defined so that Su^. indicates 
a summation beginning with «j and ending with the last term 
preceding u^,, then 

Su^ = «i4-M2 + U8+... + Ux_i; 

ASu^=(«i + «2 + tt3+... + «x)-(“l + «2 + “3+--- + «x-l) 

= Wx. 

In these circumstances therefore AS = i . 

(iii) Again, in algebraic series, S is often used loosely to indicate 
the sum of the first x terms of a series, thus : 

Su* = «i + U2+...+Mx; 

ASMx = (tti + « 2 +-.- + M*+l)-(Kl + «2+..- + «x) 

= «x+l 

= Eu^, 

whence AS = E. 

These illustrations serve to show that great care must be ex- 
ercised in introducing S into any formula. The sense in which it 
is to be used should be clearly defined in every instance : the safest 
course is always to state the limits where possible. 


10. Application of the relation between S and A. 

By treating the operator S as equivalent to A“^, the method of 
separation of symbols can be employed for the solution of problems. 
For example, a convenient formula for the evaluation of 2a*«x can 
be evolved by which the necessity for the continued application of 
summation by parts can be obviated. 


Example 6. 

Prove that 




-Ix- 


aA 


a^A® a®A* 


a—\{ a-i (a— i)* (a-i)* 
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Now = «“ w* 

and Sa“Ma. = omitting the arbitrary constant 

a*u^. 

Let (j) (^Ej = A^ + A-^E -x- A.2E“ + AfE^ + . 

Then the (r + i)th term in <f> (E) is A^E^a^u^. 

I.e. ^,a*+X+r = «*- 4 ,aX+r 

= a^Ara'E^u^ 

= a^A^(aE)’' t/y, 

.-. <f> {E) a^u^, = d‘<f> {aE) u^, 

so that if <l> (£) is the operation (E— 1)'* 

(£ - i)-^ a^Uj, = a=‘ (oE - 1 )-i u ^. ; 


'Ea’^u^ — a* {aE — 1)“* 


= a^ (fl + aA- i)“* Uj. 


» / ^ 1 r “‘i 

:a*(a-i)-i I 4- — 
a — 


a — I 

omitting the arbitrary constant. 


aA 

I h 


aA_ 
a 2 A 2 


a- 1 (a-i)“ 


Note. An alternative (and simpler) proof of this formula can be 
obtained by the use of the expansion of Eu^Vj. in a series. See Chap. VII, 
paragraph i2. 


Example 7. 

Apply the above formula to evaluate T3® (*® + x^ + *+ i). 

A«i = A (x® + x“ + x+i) = 3X* + 5* + 3, 

A*«a, = A^ (a:* + x* + x+ i) = 6 x + 8 , 

and A*«i = A'' (**+x^ + *+ i) = 6; 


23 ® (x* + x® + *+ i) 


- JL 
’ 3-1 


«. - A«x + 7 — ^ ^ 


( 3 - 1 ) * ( 3 - 1 )* * ( 3 -i)»‘ 

= ^- [Ar* + x* + *+i - | (3** + 5x + 3) + J( 64 ( + 8)— 1^.6]+c 


^ ( 4 X*-i 4 x* + 28 x- 23 )+c. 



no 
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It is often of advantage to set out the rational integral function 
in factorial or binomial notation; the successive differences can 
l^en be obtained with little difficulty. 

Thus, by the method given in Chap. I, paragraph 22, 

j[:® + ** + jc+ I + + 1, 

so that Au^, = 33c< *) + 8a:( + 3 , 

A2u^ = 63c<« + 8. 

A»Uj = 6, 

and, by adopting the formula for we have easily that 

(jc* + jc* + a:+ 1) = ^ (4Jc<®> + i8a;<'> —23) + ^. 

O 


To express — 2Jf^^' 4 - i8jc**’ — 23 in the form 

+ bx^ + cx + d, 

we use the method of detached coefficients applied inversely : 


4 

o 

4 

o 

4 

o 


— 2 
8 


— 10 
. 4 J 

-14 


18 
— 10 

28 


-23 


Hence 


^3'^ + + i) = 7r (4^^®— 14^:*+ 28^ — 23) 4- C, 

O 


as before. 


11. The following examples are instructive. 

Example 8. 


1 *(* 4 -i) 


3“"- 


Evaluate 
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„ 1^J3 L_^ 

■* *(* + i)3“ jjK x+ij 3* 


^31 ^ 3 _. 

*3* JC+ 1 3*^* 


= - 3 A 




I I 

^ 3 "J 


n+l 

1 


_ II 

Example 9. 

Show that the general term of the recurring series 
Up + «!:« + ^2 Mr . 

for which the scale of relation is i ~px — qx-, is Aa"-^ Bb^, where a, b 
are functions of p, q and A, B are constants. 

Since i —px-qx‘‘ is the scale of relation, 



Un-pu„-i-qu„_i = o\ 

i,e. 

u„ -pE-^u„ - qE-hi„ = 0, 

or 

(i-pE-^-qE--) u„ = o. 

Therefore 

( I - aE~^) ( I — bE-^} I/,, = 0 

if 

a + b=p and ab= —q. 


This will be true if either 


(i— a/r~‘)«„ or {i —bE~^) u„ — o, 
i.e. if ii„ — au„_i or = o. 

Now if u„—au^_,-o, 

then ii„ = au„_i, 

and the series is a geometrical progression with common ratio a. The 
general term of the “a” series is therefore Aa”. 

Similarly the general term of the "b" series is Bb'\ where A and B 
are constants. But if a new series be formed by the addition of these 
two progressions the relationship will hold good for this new series. In 
other words the most general solution is 

u„ = Aa” + Bb'', 

where we may give A and B any values, but a and b must satisfy the 

equations . , , 

^ a+b=p and ab= —q. 
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Example 10. 

If fourth and higher differences are ignored, prove that the sum of n 
successive terms of a function, of which Uq is the central term, is 

n«o + («* - n) A^_i, 

where n is odd. 

Since Uq is the central term, it will be convenient to use a central 
difference formula. 

Gauss’s forward formula gives 


Ux = Uo + xAUf^ + {x (*- 1 ) + J (x+ i) a: (jc- i) A^«_j 

= «(, + *'i*Auo + Jje***A^_j + J (ar+ i)<®> A®tt_] ; 

Zkj. = C+ Atto+ J**®* A^_i + ^ (a;+ i)'** A®k_p 

On summation between the limits — J («— i) and ^ (n— i) the co- 
efficients of Aug and A®m_j will cancel, and we shall have 


l(n-l) 

-l(n-l) 


C + xUf) + Auo 4- A*«_i 4- ^ 4- 1)*** A®u_i 


i(n41) 
-J(n' 1) 


n 4- I 
2 



’«4-I«-l« — 3 

f n~i n + 1 

1±3X 

2 ' 2 ' 2 ' 

{ 2 ' 2 

2 /_ 


A*tt_j 


= nuo + ^(ft^-n) A^_i. 


12. The ose of symbols of operation in the summation of alge- 
braic series. 

Many forms of algebraic series which at first sight do not seem 
to lend themselves to summation by the method of separation of 
symbols can in fact be summed very simply by that method. 

For example, if c, be the coefficient of x’’ in the expansion of 
(i +x)”, where « is a positive integer, the sum of the series whose 
rth term is (— i)'"”* [a-(r— i)] may be written down almost 

at sight. The series is 

Coa-Ci(a-i)+C2(a-2)-... 

i.e. CqU — CiE~^a 4- Cg£~^a — . . . = (i — E~^)” a 

= A"£-»a 
= A” (a — n) 

= 0 , for all values of a. 
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Again, consider the well-known series 
^ fi 52 

X AI+I X + 2 
This may be written as 

(cq-c^E + c^E^- ...) x-^ 

= (i —E)” 

= (-i)" 

nj 

.r (x-l- 1 ) (jc + z) ... (.r-l-n)' 

The following examples are illustrative of the method. 

Example 11 . 

Evaluate 

f « njn-i) 

jf+i (x+i){jc + 2) (x+ i) (xH-2) (x + 3) ■■■ (a:-(- I) ... (Ar + w+ i)' 

This series is +« (n— i) ... + (- i)" w! 

= x<-ii + ", Ax<-i' + A"x'-« 

1 ! 2 ! » 1 

= (i +A)'' x'-i' 

_ I 
~(x + n+ i)’ 


Example 12. 

Find the value of 

n (n— 1) (n — z) , ,, 

- ^a + 3fe)"+i-.... 

Taking the interval of differencing to be b, the series may be written as 

= (i — £)" fl"+i 

= ( - 1 )" A"a"+*. 


P M A S 


8 
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Now 

jjiH-i = (« + 1) b + terms in a*’’* b”*^-' (r < n). 

A"a"+^ = (« + i) 1 A"a<i> + Ju (n + 1) n ! 

= (n + i) ! (2a + nb), 
and the sum of the series 

= H-6)"('»+0!(2a + »i). 


Example is. 

Sum the series 

je® + (je+ i)=' + (je + 2)*+... +(* + «— i)*. 
The series may be written in the form 
5 =(i+£+f.’*+. 


=A-‘ (* + «)’-A-»r» 

= A-i [(* + n)(i, + 6 {* + «),„+ 6 (* + n),s,] 

= J [(* + «)*“ (x + w-i) 2 — *2 (x_i) 2 ]^ on reduction. 


Note. It is unnecessary to introduce any constant of integration : for 
if we put 

A-ijc® = «5t + c, 

the constant disappears on operation by i?” — i . 


13. “Summation n”, or [n]. 

An interesting example of the development of a series of opera- 
tions by the method of separation of symbols occurs in the theory 
of graduation. One of the objects of graduation is to obtain a 
smooth series of numbers instead of the rough series given by the 
actual data. A step to the solution of the problem consists in 
replacing each term of the series by the arithmetic mean of the 
n successive terms of which the given term is the central term. The 
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“5 


operation of summing these successive terms is generally denoted 
by [«] (“summation n”). For the present we will assume that 
n is odd. 

For example [5] Uo = «-2 + “-1 + “0 + “1 + “a* 

[«] Mo = M n-l-f-M n-3 + .-- + Un-3 + «n-l. 

“22 2 2 

Consider a simple summation: [3] 

By definition [3] “0 = + “o + “n 


and if we write for [3] we may operate again on to obtain 
[3] «o- 

In that event we shall have 


l 3 ]^’o = [ 3 ]«-i + [ 3 ]“o+[ 3 ]“i 

= «_j+ + 3M0+ + «s 

= Uo + (“-1 + «0 + “ l ) + (“-2 + «-l + “0 + «1 + « 2 ) 

= [i]«o + [3]«o+[53 «0- 

If therefore we denote the double operation [3] [3] by [3]* u^,, 
we have the symbolic identity 

l 3 ]*^[i] + [ 3 ] + [ 5 ]- 

Similarly [5]*= [i] + [3] + [5] + [?] + [ 9 ]. 
and [nf = [i] + [3] + [5] + [7] + • ■ • + [2” “ i]. 

where n is odd. 

The identity between [3]- and [i] + [3l + [5] can be seen at 
once by writing down the terms in diagrammatic form ; 

[3]’' “0 = [3] “-1 + [3] «o + [3] «i 



= [ 5 ]“o+[ 3 ]«o+W«s- (Fraser.) 
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These results are general. The sum of n consecutive odd opera- 
tions is [w] [r], where [r] is the middle operator. E.g. 

[5] + [7] + [9] + [”] + [13] = [s] [9] = [9] [5]- 

14 . We can express [w] in terms of the ordinary finite difference 
symbols thus: 

For a simple value of n, say 3, 

[3] “0 = + «o + «i = 3«o + w-i " 2«o + 

= 3 Uo + A-m_i 
= (3 + S2)mo, 

where 8^ is the symbol denoting the second central difference. 

Generally [«] = «H 8* 4 - terms in 8^ and higher differences 

24 

if these exist. (Cf. Ex. 10.) 

The relations above are on the assumption that n is odd. If 
n be even we must find a meaning for the summation symbol. 

By analogy [2] is the sum of two values of u whose suffixes 
are such that their sum is zero and their difference unity. 

I.e. [2]Mo = tt_j + Uj. 

Hence [2? «o = {[i] + [3]} «o 

= (4 + 8*) Mo, 

so that [2] = 2 ( I + i8*)^ 

S2(i+|8*-rk8* + 

which is otherwise obtained by expressing m_j + mj in terms of 
central differences. This agrees with 

[«]=« + "-— 8* 
as far as third differences. 

The meaning of [n] when n is even is now evident, and we need 
no longer restrict the values of n to odd integers. Thus the formula 

[n] Mq = M^n — 1 + M^n — 3 + . .. -|- Mn — 3 + Mn— 1 
2 “2 ■/ “ 2 * 

applies for any integral value of n whether odd or even. 
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•16. The following alternative form for exhibiting relations 
between the operators, whether odd or even, is due to Mr G. J. 
Lidstone : 





-Ull4l“. 


“0 



I(1“.-!j1“^ + [3]“-j + [31“4 +l3l“j 
-(4I [3] “»■ 


The tracks marked show the re-grouping. The diagram 

is of virtually the same form if we begin later; 



16. By means of the relations already proved we may develop an 
unlimited number of formulae involving [n], [m], etc. 
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Example 14 . 

Prove that lo [i] — 3 [3] = 2 [3] — [5] as far as third differences. 

{10 [i ]- 3 [3]}“o=io«o- 3 (3«o + f|S»tto) 

=«o-3SV 

Also {2 [3] - [S]} «o = 2 (3 «o + - ( 5^0 + 

= 1/0-35^. 

Example IB. 

Given [5] u_^, [5] u^, [5] Uj, find u^, fourth and higher differences 
being neglected. 

By Stirling’s formula, 

Am. +A tto * (x®- 1) A^m_, + A*«_, 

M, = M, + x + + ^ ?_ 


„ + x(x*-i)AV,_+A»m_, 

M* + M_* = 2M(, + **A*H_,. 

[5]«O = «-* + «-1 + «O+“i + «2=K + «-*) + K + «-i) + «0 

= 5 Mo + 5 A*«_i. 

Similarly [ 5 ] «-s + [s] «*= >o«o+ *35A*«_i. 

Eliminating A*«_i, we have 

i25«o = 27 [5 ]«o-[ 5]«-«-[5]«6- 

An interesting note by Mr D. C. Fraser on the properties of the 
operator [«] occurs in the Actuarial Students’ Magazine, No. 3 
(Edinburgh, 1930). Here the general form for [«] u, is given by 
Mr Fraser as 


Ei-E-i 


which defines the operation whether n is odd or even; and the proof 
of the identity [3]* = [i] + [3] + [5] is made to depend on Ae development 
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of the operator E. Thus : 

[3] “0 [3] ^ 

_(^-£^) (£-i + £o + £i) 

Ei-E-i “® 

(£* - F-*) + (£* - f^) + (£5 - E-^) 

Ei-lFi “* 

= [i]“o + [ 3 ]“o + [ 5 ] % 

The general formula for [«]* u can be proved in the same manner. 
Many further examples of the use of [n] will be found in various 
papers in the Journal of the Institute of Actuaries. For simple extensions 
of the method see particularly Hardy, J.LA. vol. xxxii, p. 371, and 
Todhunter, J.LA. vol. xxxn, p. 378, and for generalizations, Lidstone, 
y.I.A. vol. LV, p. 177, and Aitken, vol. LX, p. 339. 

EXAMPLES 6 

Sum the following series : 

1. 7, 14, 19, 22, 23, 22, ... to n terms. 

2. 2, 12, 36, 80, 150, 252, ... to n terms. 

3 - * 0 . 9. 7. 4 . o. -S. to 30 terms. 

4. 5, 10, 17, 28, 47, 82, ... to 20 terms. 

5. 10, 23, 60, i6g, 494, ... to t terms. 

6 . 1,2, 4, 8, 17, 40, 104, ... to » terms. 

7. I, o, — I, o, 7, 28, 79, ... to zk terms. 

8. 10, 14, 10, 6, ... to n terms. 

9- *25, 343, 729, 1331, 2197, ... to n terms. 

10. I, o, I, 8, 29, 80, 193, ... to 17 terms. 

Use the methods of finite differences to sum to n terms the series 
whose Jrth terms are p* 

11. (* + 3) (* + 4) (* + 5). . 12. *(* + 2) (* + 4). 

13. (3Jf-2)(3*+i)(3* + 4). ' 14. *(*+!) (* + 3). 
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15. *(a:+i)(x + 3)(* + 4). 16. {2*4-3)(2x + 5)(2x + 7)(2x + 9). 

1 „ I 

(* + 3)(* + 4)' ' * ' (*+i)(* + 3)’ 

(x + 2) (x + 3) (x + 4)' (3JC-2) (3X+1) (3X + 4)’ 

* + 3 

21. —7 r-7 r. 

x(*+i) (* + 2) 

22. Sum to n terms 2. 4. 8. 14 + 4. 6. 10. 16 + 6. 8. 12. 18 + .... 

23. Obtain the general term of the series 

1.4. 10 4-7. 13 7.10.16 
and find the sum to n terms and to infinity. 

n 

24. Find E M*, where Uj = x (x + 2)(x + 5). 

I 

25. The two series 6, 24, 60, 120, ... and o, o, o, 6, 24, 60, 120, ... 
are given. Find the sum of n terms of each of the two series. Compare 
the results and explain the difference. 

26. Sum the series i, 5, 17, 53, 161, ... to n terms. 

27. Evaluate A“^ 

28. Show that A (x!) = x (x!) and hence sum to n terms the series 

1 + 3. 21 + 7. 31+13.4! + 21. 5! + .... 

n 

29. Find E where (3X + 2) (3X + 5) (3X + 8). 

1 

30. Sum to « terms i ■ 3 * + 3 • 5 ^ + 5 . 7 ■ + 7 • 9 '^ + .... 

31. Obtain the formula 

a+ n~ 1 
a 

and use the formula to find the sum of n terms of the series 
-8, -5,0, 14, 44 

n 

32. Prove that E (x*+i).x! = n.(» + i)l. 
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33. Sum to n terms the series whose *th terms are (1) *^(3* + 2), 
(ii) 2* (** + x) by finite integration. 


34. Evaluate S 




1 l(*+i)(* + 2). 


-i c. Sum to infinity — ~ 1- — I — + - --- — + . . .. 

2.6.10 4.8.12 6.10.14 

36. Find the sum of n terms of the series 


i‘‘ + 


3-5-, 7' 


+ .... 


37. Obtain the indefinite integral of (2JC- i) 3®. , 

38. Show that the series 10, 24, 61, 163, 452, 1290, 3759, ... can be 
split up into two other series. Find the two series and hence sum the 
original series to n terms. 

V 39. Prove that = and apply 

this formula to find the sum of the first n terms of the series whose rth 
term is (r+ i) 

._i f (.v+2)^ ) 


40. Evaluate 


(^;(x + 2) (x + 3) (x + 4)i ■ 


41. Find the sum of the squares of the first n natural numbers by 
the method of finite integration. 

15 7 

42. Sum to n terms -'-o"' — o ' +.... 

2.5.8 5.8.11 8.11.14 

. ^ . I . 7 ^ . 7 

43. Find the sum of the infinite senes +-, + , +.... 

T.> 2 2^ 2’ 

44. Find the sum of n terms of the series 

2.2 + 7.4+14.8 + 23.16 + 34.32 + .... 


45. Prove that E +^^-^3 (Att„-x"A«„) 


(i-xy 


(A*«o-x"A*ttJ + .. 


Apply this formula to find the sum of the first n terms of the series 
whose rth term is r (r + i) x'~'^. 


46. Evaluate A~^ 


2*.*. 


(2X + 


I)!. ■ 



IZ2 
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47. Use the method of finite integration to obtain the aum of n terms 
of the series i . 3® + 3 . 5® + 5 . 7* + .... 

48. Find the function whose first difference is <zx® + bx* + cx + d. 

49. Prove that i'’ + 2*’* + 3''*®+ ... is a recurring series, and find its 
scale of relation. 

50. If u„ is the nth term of the series i, 2, 3, 5, 8, 13, . . . in which each 
term after the second is the sum of the two preceding terms, prove by 
the process of mathematical induction or otherwise that 

51. Show how the methods of finite differences can be employed to 
find the sum of a series of the form 

(a + A)' + (a + zA)*' + . . . + (a + xA)'" + . . . . 

52. Sum the series 

I ! A 2! nl 

h-— 1-. H 

a+i (a+i)(a + 2) (a+ i) (a + 2) ... (a + n) 

r— n 

53. Find the value of S 

r-i 

54. Prove that {2 [3] - [s]} «o = “0 ~ 3 ^^-i approximately. 

55. Obtain the approximate formula 

125140 = [5]® + A«,}. (Woolhouse.) 

56. Prove that [n]» - [m]® = {[n] - [m]} {[n] + [m]}. 

57. One of Hardy’s graduation formulae is 

{i 4 o + Ai 4 _,-Aa,} = Ko. 

Prove that this is approximately true. 

58. Express {[ 3 ] + [S]-[ 7 ]} “0 terms of «4_„ Mj, Mj, and 
hence prove that 

— ^ {[3] + [5] — [7]} Ug reproduces Ug to third differences. 

59. If = [5] u„, prove King’s formula: 

«o = -ztoo — •oo8A*if 

fourth and higher differences being neglected. 
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MISCELLANEOUS THEOREMS 

1. In this chapter it is proposed to treat of certain propositions 
and applications of finite difference methods which are not essential 
to a first reading of the subject and which may conveniently be 
dealt with at a later stage. Some of the theorems are developments 
of familiar processes: others are alternative methods of approach 
for the solution of problems involving the principles of finite 
differences. 


DIFFERENCES OF ZEUO 
2. If in the identical relation 

= (A; + n)’"-rt<i) {x + n- + + 

we put x = o, we obtain 

= «”• - « (1) (« - O’" + « (8 ) (n - 2)'" - . . . . 

By continued application of this formula we can obtain values of 
A"*"* when x = o for all integral values of n and m. 

For example, if m = 3 , 

[AV]^=3='-3.2» + 3.i=’ = 6. 

The values of are known as "differences of zero”, and 

in accordance with this definition the expression is often written 
as A**©*". » 

It is evident that a table of values of differences of zero can be 
constructed if we can obtain a relation between corresponding 
values of A"o'". 
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We have from the above 

_ j ^ _ 2 )ni _ 

= n [n™-i -(«-!)(,)(«- 1)”*-! + (« - i)(2) (n - 2)'"-' - . . .] 

= n [(i +n- (i + «-2)’"-‘ 

= nA"-^ I”*-! 

= nA"~^ Eo'"'~^ 

= «A"~^ (i +^) o’"~‘ 

= n (A"“^'"~i + A"o’"-i). 

Alternatively, 

A“o’" = «"*~n(j, (« -i)’" 4 -n( 2 ) (n~z)”' — .... (a) 

But, since 

A"a:™ = (jc + n)"* - n(j, (jc + n - i)”* + rii^) (x + n-2)”'- ... 
we may put n - i for «, m - 1 for m and i for x, and obtain 

_ „m-l -(^n- l)(x) (ft - + (n - l)(2) (« - 2)"*“^ - ... 



= - A"o"*; 
n 

• ^710"! = = n (A"-‘o”'-i + A"o”’-^). 

This proof is given by de Morgan. 

Another method for obtaining this relation, depending upon the 
formula for the nth difference of the compound function u^v^., is 
given below (paragraph ir). 

3. Since A"jc" =n! for interval of differencing unity 

A"o" =«!. 

Similarly, since 

A"+’’*" = o, A"+’'o’‘ = 0 , 
i.e. A”o'” = o, when n>m. 
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We can now build up a table of differences of zero by continued 
application of the relation given in paragraph i above. 

The table is 


m 

Ao™ 

A^o™ 

A'o™ 

A'o™ 

A^o™ 

A'q™ 

I 

1 

0 





2 

I 

2 

0 




3 

I 

6 

6 

0 



4 

1 

14 

36 

24 

0 


5 

I 

30 

150 

240 

120 

0 

6 

1 

62 

540 

1560 

1800 

720 


and so on. 


4. An interesting application of the use of the differences of zero 
for the calculation of the coefficients in an expansion is as follows. 
The fundamental formula 


can be written as 

2) ^ 3) 

= — r — r AX + ■••• 

2! 3 ! 

jC<2) ^ 3 ) 

x’" = o'" + a^i)Ao”' + — A2o'"+ — r A*o’"+ .... 


2! 


3! 


By use of the relation 

A^o”* nA^o”*"^ A"^^o'"“^ 

A'o*" = n (A"©"-! + A"-'o"'-i), i.e. — r- = r — + -7 ^ , 

n! w! (w— 1); 


a table of the coefficients in the expansion of x”* in terms of succes- 
sive values of the factorial can be written down in a similar 
manner to that given above. 


5. The differences of zero have many special properties, and they 
are used in higher work in the theory of series. A simple example 
involving their use is given below. 

Example 1. 

If (i -f A)” = Co + f,A-(- ... n being a positive integer, find the 
value of + , 
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Now (*- 1)* Cl + (* - 3)* c, +(* - 5)» Cg + ... 

= (£^lCl + £^-®CJ + J?-*Cg + ...) ** 

= + E-^T - (i - -E'*)"] ** 

= ^ [(A + 2)" - A"] 

=:i[('^ + 2)"-^”] {*-»)* 

= ^ [(A + 2)" — A"] o* when x = n 
= J [2”o* + B2 "“‘Ao* + ^« (n - I ) 2"~“ A^o* + . . .] 
= J [b2 "“' + n (« — i) 2"“*] 

= 2"“* « (b + 1) on simplification. 


RELATIONS BETWEEN THE OPERATORS 
D AND A 


6. It is shown in Part I (p. 65, paragraph 5) that 

/ (X) + hf\x) + ^, /" (X) + . . . + /(«->' (*) + ... 

converges to the limit f{x + h), provided that Lt R„{x) is zero, 

n -*- 00 

where An 

R^(x)^-finy^x + eh). 


This is Taylor’s theorem for a convergent series, and we may write 
it in the following form : 


^x+rh 


= w, + rhDu^ + 


{rhy 


Dhi^ + 


{j^f 

3 -' 


D^u^ + ... 


= 1 1 + rAD + + 

2! 3! 


...J M, 




But, for interval of differencing h. 


«»+r» = ^"x- 
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SO that AI> = log£=log (i+A) 

* A* A* A^ 
sA + + ... 

234 

, ^ I r . A!* A* A« -| 

and Z)M* = ^j^AM^-Y“x+Y«*-~“*+ -J- 

If A= I, then (H-Ay=e'^ 

and we have a result analogous to the important relation in the theory 
of Compound Interest, namely (r+/)” = e”®, where S is the force of 
interest corresponding to a rate of interest i. 


7. Since 




A* A* A« 

1 h ■ 

234 


■ A* A3 A« 

A + 

234 


A* T 

+ ... 

4 J 


= ,-[A3-A3+i^A^-|A3 + ...]. 


Similarly, = ^- [A* - |-A< + JA® - . . . 

We have therefore a convenient method for expressing the 
differential coefficients of a function of x in terms of the differences 
of the function. 


Example 2 . 

/i, (the force of mortality)= — * where is the number of 

persons at exact age a; in any year of time. Given the following table, 
find a value for fifo- 

Age* ... 50 51 52 53 

73.499 72,724 71.753 70,599 


The difference table is 


X 

lx 

A/. 

A*/. 

50 

73.499 

- 775 

— 196 

51 

72,724 

- 971 

-183 

52 

71.753 

-HS 4 


S 3 

70,599 
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dx 


A2 A3. 


= A + 


L 


2 3 

since the interval of differencing is unity 

= -775 + 98+4-333---. when * = 50, 
= - 672-667. 


Mso — 


Ldx 


_ 672-667 

*-50 73>499 


= -00915. 


Note. If /i=|= 1, care must be taken to divide by the appropriate powder 
of h in applying the formula. Thus, if is given for quinquennial 

I 

intervals of x, will be ^3 (A^w^ — + ...) and not simply 


5' 

A3tt,-|A*«^+... 


8. The result 


dx 


h 2 


A3 

H W- — 

3 


■] 


can easily be obtained by the differentiation of the advancing 
difference formula for u^. 

Taking the simple case when /t= i. 


= «o + “ ^*“0 + • • • 


or 




(±1) . 


which becomes, on changing the origin to x. 



A^m^ A^Mj. 

2 3 


Similarly, we can express in terms of central differences. 

For example, Stirling’s formula is 

Awn + Au, jc(x:3 - i)A*m_i + A3m „ 

U^ = Uo + X—^ i + -, A2w_i + -A- 3 —2 + .... 

* 2 2 ! 3 ! 2 
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Differentiating with respect to x: 

dx 2 ^*6 2 

Amo + Am_i If., A, 1 

\dxj^ 2 12 ^ ^ 

as far as third differences 

= - ~ ^ “ 3“i + 3“o - «-i + «i - 3“o + 3«-i - “-J 

= ; (wi - «_j) - («2 - tt_2) on simplifying. 

3 

Changing the origin, we have 

2 , A t / \ 

12 *^*‘''* 

A first approximation will evidently be 

du^ Ux-n-»x-i 
dx 2 

or, if the unit of differencing be h, 

^x4-ft ^x—h 

2 h 

This simple approximation can be seen quite easily from a con- 
sideration of the geometry of the figure. 
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Let P be the point whose coordinates are (jc, u^.) and let P,, P, be 
the two points whose coordinates are {x + h, and {x-h, 
respectively. 

Then if PK be the tangent at P 
du- 

- 3 — =tan a 

dx 

.^nearly 

PyM^-LMi 

M^My 

^. r+A ^x—h 

zh 

9. Another formula, giving the differential coefficient in terms of 
central differences, can be obtained from Bessel’s formula. 


a. . - i) Aa. + 




Changing a; to a; + 




Differentiating : 
d . 
dx'^^^ ^ 

If A[: = 0, 




. +A*«o ix- J \ ,, 

u,^ = A«o + a: ^ + + 

d A®u_i 


Changing the origin to * — ^ we have the approximation 

d . A»K^_j 

dx^^ ^“x-i 24 ■'■•••• 

An interesting discussion on the calculation of the values of differential 
coefficients of a function by means of selected values of the variable will 
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b€ found in T.F.A. vol. ix, pp. 238 et seq. (G. J. Lidstone). By means of 
tables of coefficients prepared for convenience of practical working 
Mr Lidstone evolves formulae for the values of the successive differential 
coefficients, both for advancing differences and central differences. 
In addition, alternative processes are given for the values of the deri- 
vatives when the intervals are unequal. 


*10. It should be borne in mind that, in choosing a formula for 
the expression of the differential coefficient of a function in terms 
of its differences, central difference formulae are to be preferred 
to advancing differences. These formulae possess smaller coeffi- 
cients and have the additional advantage that alternate coefficients 
vanish. Further, a greater degree of accuracy may be obtained by 
the use of a suitable central difference formula. 

Mr G. J. Lidstone has pointed out that if for any reason it is 
decided to use the relation 


considerable increase of accuracy can be secured by the simple 
and easily remembered device of ending with ^ instead of 

^ A"m^. We bring in for the purpose of obtaining A"tt^_i, omit 

“x+ti4-i otherwise use the formula as before, beginning with 
Au^. This reverses the sign of the error and reduces it numerically 

in the ratio 1 : (n -t- 1). Thus, - A" A’’^^ is replaced by 

n -t- 1 ' 


^ A"-- A*^!. 
n n 

It is assumed that A’*+h<, though not negligible, is not varying 
greatly. Fhe proportionate improvement increases with n, and if 
w is at all considerable, we attain approximately the accuracy of a 
further order of differences with no additional work. 

The same process may be employed for the second and third 
differential coefficients, after which the improvement, though 
present, is not proportionately so marked. 


9-a 
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THE COMPOUND FUNCTION 

11. We can adapt the principle of separation of symbols to the 
evaluation of such expressions as by a simple extension of 

the process of differencing. 

Let Aj, Ej, Ej, ... denote operations on alone and A^, Ej, ... 
operations on alone. 

Then = 

— E^u^. E^v^ — 

= (E^E^~\)u^v^. 

AX^’x = (^1^2 - 1 )" “x^'x- 

By expressing E^E^ in terms of A^ and Ag we are enabled to 
obtain expressions for the expansion of 
First, if n= I, we have 

A«x^’x = [{ I + A,) (I + Ag) - i] 

= (A, + Ag+A,A2)t/^t’, 

= (A, + A2Ei)«*t>^ 

= AiUg,t;^+AgEjWg,t)^ 

= t;^Aia^+£’jK^.Agt», 

= ^xAi«,+«x+iAgV^, 

or, dropping the suffixes, 

= v^Au^-\rU^^^Av„ 

which is otherwise evident. 

Again, 

AX^x = (Ai + Ag)" 

= (Ai" + n(i)Ai’‘-J EiAg + nigjA,"-* Ei*Ag® + • ■ •) “x^x. 

which is easily seen to be 

AX + ”U) At^* A"-X+i + "(aiA*^* A"-X+2 + • • • + «x+n AX- 
If in the above expression we put u^=x'^^ and v^ = x, we have 
A® . x) = xA"x”*“^ + bA"^* (jc + 1 )”'~K 
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Let *=o; then 

^"0”* = jm-i _ £b”*~ = n (A"~^o’"“^ + A^o*""^), 

which is the relation proved in paragraph 2. 


1 2, The ordinary formula for finite integration by parts is 
'Lu^Vx = u^'Lv^-'L [Auj.S®^^j] + an arbitrary constant. 

In cases where the formula has to be applied more than once 
we may use the series found below. 

SsA-i^ (Ai + A2 + A,A,)-i 
s[A, + Ai(i+A,)]-i 
sAr‘ [i+A,Ar* (i+A,)]-i 

= Ar' [i-\ Aj-i ( I + A^) + A.^A^-® (I + A^)^ - . . .] 

sAr^-4Ar^(t + A,)+VAr^(r + A2)^-... 
s Aj-i - A, A2-2 £'j + - . . . . 

On dropping the suffixes and inserting the appropriate functions, 
we have 

= u, Swj. - Aa^ + AX - • • ■ • (a) 

By an alternative treatment another formula is obtained : 

SWx^i = «x-oS^x-AM;r-2S^'a:+ AX-S-^^’i- ■■■. (*) 

As an example of the use of a series for ^u^Vj. we may prove the 
formula for given in Chap. VI, paragraph 10 . 


Example 8. 

Prove that 




( aA a»A- 

Ji + 


a — 1 ( a — 1 (a~ ly (a — i ) 


a*A’ ) 

.•Am, 




0* 


a- 1 (a-i)* 

and so on. 

Let a* = t), in series (a). 


17 X 4*8 




■7 «> S^a*+^=;r 

(a-i)» (a-i)^’ 
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Then 


"Lu^* — Ux Sa* — Amj S®a*+® — 2*a*+® + . . . 


„x+l 


a *+2 


rtX4-3 


= ^ r«^3- ^ 4 + - 


_ a® j 1 1 

a— 1 1* a— I (a— 1 )* (a— i)®^'") ' 


1 3. The following is a further illustration of the application of the 
above method. 


Example 4. 

Prove that 

£» (UxVx) = Ux y,”Vx-n^^^Aux + (« + 

- (n + 2 ), 3) A^Ux + . . .. 

Now Aa^ «;* = [Aj + A, ( I + A^)] Ux v,. 

S" = A-" = [Aa + Aj ( t + Aj)]-" = A*-" ( i + A, A^-^E^)-". 

:. I"«^t’^ = Aj-'‘ [i -«,„A,Aa-‘£’a + (« + i)(3) Aj'Aa'^Ej® 

-(« + 2),3) A,®Aj'®£'2»+...] UxVx 

= Ux l^^Vx - «(1) Am* 2"+'f*4i + (« + 1)(8, A*m* 2:'‘+2r*+3 

-(m+2),3, a^m*i:"+% »+3"^ .... 

If If =: I we have 


= (i +AiA2“^£2)”^ u ^ v , 
AjA2 ^^2 


=^■-‘1-7 


+ AiA,-1£'3 




= (v‘- 

= (Ar^- 


Aj A3 ^£'a N 
Aj+Aj^a/ 

AjA£^^^\ 

J 




= Ux ^Vx - A-» (Am* I.Vx^i) 

= «* 2^«'*-2(AM*St)*+i), 


the ordinary formula for summation by parts. 

The subject of finite integration by parts is treated more fully in 
Chapter VIII. 
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FUNCTIONS OF TWO VARIABLES 

14. When x and y are independent variables, u^y, f(x,y), ... 
represent functions which assume different values according to 
the values of x and 3;. An alternative notation is Mj..„ or f{x:y). 
For example, the function 

x^ + zxy + x’ + 3jy, 

in which x and y both vary, may be written shortly as either Uj^y 
or u^.y. If j is a function of x, we may reduce to the form 
and thus obtain a function depending on x alone. 

Now suppose that x is changed to a: + A and that y is changed 
lo y + k while x remains constant. Then the new value of the 
function is dependent on x-\-h and y-^k. It is not necessary for 
both jcand y to vary: x may become x + h while y remains constant 
or vice versa. 

If the values of the function proceed by equidistant inteiv^als, 
we have the following scheme; 


^x.y 


^x+2lt:y 

^x+3h-.v 



^x+2h.v+k 

^x'tSh.y-irk 

^x:vh2k 


^x+2h:v+2k 


^x:v+Sk 

^x4A:i/+3A 

**x+2A:v+3* 

“*+3A:i/+3* 

or, if our ' 

origin be ( 0 , 0 ) and h =k = 1 , 


^l;0 

“1:0 


“3:0 

“0:1 

“i.i 

«8;l 

“8.1 

*^0;a 

“i:a 

“2:2 

“3:2 


1 5. If we are to apply the processes of finite differences as hitherto 
defined we must distinguish between an increase in the value of x 
and an increase in the value of y. We therefore write to denote 
the operation of increasing the value of :if by a unit difference while 
y remains constant, and Ey similarly for y while x remains constant. 
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That is, £'*Mo.o = “i:o and £yUo:o=«o:i. 
so that AjMo.O=«i:o-Mo; 0 and A„Mo:0 = «0:1-“o:0- 

Again, A„«o:o = (Av«o:o) 

= Aj, («0;l — W(J.o) 

= AjUo:i — A_,.Mo.q 

= “l;l-“o:l-«l:0 + «0:0. 
and Aj,*A„Uo:o = A^,* («oa - «o:o) 

~ Ajj^Uq:! Ax^O.O 

= («j.i - 2«i.i + Mo-i) - (ttj.o - 2«,.o + Uo:o). 

The general formula corresponding to the advancing difference 
formula for one independent variable is 

“m:n = (l+A^)’"(l+A„)'‘Uo:0 

= (i + mti)Aj. + m(2)Aj’^+ ...) (i + n(,)Ay + n(j)Ay*+ ...) u#;,, 

= (i + m(i) Aj + wj(2) A/ + m(3)A^s + . . . 

+ «(i,A„ + m(i)«(j)A*Ay + OT(2)W(i)A^*A^ + .,. 

+ n(2 ) A^® + OT(i,«( 2,A, Ay® + . . . 

+ «( 3 )Ay®+..)« 0:0 

= «0:0+('«(l)Ax + «a)A;,) Mo;0 

+ (w»(2) A*® + A^ Ay + 71(2) Ay®) 

+ ('«(s)A;r^ + m(2)n(i)A/Ay + m(i)«(2,A,Ay® + n(s)Ay») 

It often happens that a certain order of differences, say the rth, 
is sufficient for interpolation along the jc-line, while a higher order 
is necessary for the j-line. This is especially so when ac is a young 
age and y an older age (see p. 137, Ex. 5). In this case, the formula 
can be simplified by omitting all terms involving A/‘*^w whether 
standing alone or combined with values of Aytt. For the same 
reason it may be desirable in tabulating to use smaller intervals 
for y than for x, according to the nature of the function. 
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16. Application of the formula. 

Example 5. 

Given the following table of values of estimate the value of 

«SS;i 7 - 


X 

3'=I5 

y = 20 

y = 2S 

20 

5-947 

4-418 

3-547 

25 

6-046 

4-530 


30 

6-144 




Here the interval of differencing is 5. Changing the origin to the 
point (o, o) and the unit to i, the data are given for the points (o, o), 
(1,0), (2, o); (o, i), (o, 2) ; (i, 1). The value required is u.r, >4. 

Differencing downwards for values of Aj-Ug g, etc., we have 

A* Mo : 0 = -099 : .• 0 = - -ooi . 

Differencing across for values of A„Mo:o, etc., 

A„Mo;o= -1-529: A„* 2 /o: 0=-6 s 8. 

Also A^A,Moo = «i;i-«o:i-»ff:o + «o:o=-OI3- 

«.4:.4 = (l + -6A^--I2Aj,* ...)(i + - 4 A„-T 2A„2 ...) Ug:g 

= (i + •6A, + •4A, - -r2A**+ •24 Aj.A„ - ■i2Ag-) Mo:o 
= 5 - 3 i 9 - 

No/e. Here it is not really necessary to use A,*Mo:o: see end of 
paragraph 15. 

17 . While for most purposes the formula given above is con- 
venient, special circumstances may arise in which other methods 
may be more suitable. Where the interv-als are not equidistant we 
may apply either a method of divided differences or one of various 
adaptations of Lagrange’s formula depending upon the number 
of points given. If, for example, four values of are given, 
namely u^.g; u^.g, Up-^; then it is quite easy to show that 

„ (^ -^) (:>'-^) I „ (v-.g) j y-a) 

“x:v (a - /3) (a - 6) * *’ (x -p){b- a) 

(x-g) {y-a) ( x-g) (y-b) 

+ ^-p-g)(u-6)- 

If more than four values are given the formula becomes un- 
wieldy. It is seldom necessary to interpolate except between 
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equidistant values of the function, and in that event a form of 
advancing or central difference series is generally preferable. 

Two-variable functi ons are of gre at frequency in actuarial work. 
T ables of annuity-values (g^.y) dependin g upon joint lives are often 
av^lable for quinquennial values ol x and y, and when values 
a t ages other than IHose tabulated are regiiireri remnr^ must hp 
h ad to methods of interpolation . Although the formulae given 
above are of general application special methods can be found to 
meet the requirements of the problem to be solved. 

For example, if quinquennial values of a^.y are available, and if 
the two ages concerned are such that their sum is a multiple of 5, 
we may choo.se our origin and interval of differencing so that 
x+y~ I. We have then, from the general formula for y, 

“xu-x = “0:0 + [x^x + ( 1 - ^) (x - I ) ( - 2A^ + Aj,2)] ; 

i.e. 

“x-.l-I = «0-.l + •* (“l-.O - “O'.l) + i* (jf - l) («j.o - Ziq-i -I- Wo;i). 

Again, if *-(-^=2, this formula becomes 
‘^x:2-x = hx (X- l) (x-l)(x-2) ttj.j, 

or, on changing the origin, 

“x:-x = H*- 0(^-2) «o: 0 -*(^- 2 ) + {x-l)u^,__^, 

for which the data required are «o;o, Mk-i and The problem 
is thus reduced to a single variable interpolation. 

This second formula is very useful in practice. As a rule we can 
choose our data within wide limits, and it has been found that 
with certain functions the three-term formula gives as good 
approximations to the true results as do formulae involving higher 
orders of differences (see Spencer, vol. XL, pp. 293-301). 

The general second difference formula of which the above is a 
particular example is 

^X’.rx P^Zitr^ 

and in the note referred to above, Spencer gives a table showing 
the application of this formula according as r takes the values 
o, I , — I or 2. 
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Another form of the formula for an interpolated value of Mj,;, when 
four values are given is v = ^ (’?“o o+J“o i) + -*^ (W o+J^i 1). where 
X and y are both less than unity and x + i—y + rj = i. The second 
difference formula can be written as 

«*:v = [i-('^iV + A,V)][^(’?«o.o+>’«o o+yui 1)], 

where 8^,^ and are second central differences with respect to x and 
y respectively and and /ij are factors depending upon the values of 
X andjy (Buchanan, T.F.A. vol. x, pp. 329, 330). 

Example 6. 

Values of the joint-life annuity „ for quinquennial ages being avail- 
able, find a value for jj. 

(1) Take the origin at (40:50); then if the interval of differencmg 
be s years, (44:51) wall be represented bj (-8. 2) and x-l-y=:i. 

i-i = «o i + *("i o-“o + (": o-2«i i + “o 2)- 

The data required are 

^*40 64= = fl«60=tO’7^3 

<*60‘6a= ^°'202 ^46 06= 9'8S4 

Then 10 135 + -8 (10 763 - lo 135) 

-t- 1 -8 ( - -2) ( 1 o 202 - 1 9708 -(- 8-926) 

= 10-135 + -5024 + -0464 
= 10-684. 

(ii) Take the origin at (40:45) so that (44:51) will be (-8:1-2) and 
x+y = 2. 

«*-2-x = i (*- i) (*-2) «o;a-ac (x-2) + (*- i) Uj.,. 

The three values required are 

'^10-66 = to ^35» 
a45. 60 =10763. 

<250:46 = 10-763. 

«-s i-2 = i (-•2)(-i-2) io-i35--8(-i 2) 10 763 + J (-8) (--2) 10-763 
= 10-688. 

If nine values surrounding the point (44.-51) be taken and a Lagrange 
formula for these nine values be used, the value for 5, becomes 10-684. 

This nine-point formula is a safe formula for occasional interpolation, 
and by its use the risk and labour attaching to the calculation of differ- 
ences may be avoided. The formula is used centrally, the area of mter- 
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polation being as shown in the diagram below. The ordinary single- 
variable Lagrange interpolation formula is used to interpolate for x in 
each column, and the formula is used again to interpolate fory from the 
three calculated values. 



This process is very simple with the aid of a machine, especially for 
quinquennial intervals, when the coefficients are as follows : 


Interval 


«„ 

“1 

-•4 

— M 2 

+ *84 

+ •28 

— •2 

— *08 

+ •96 

+ -12 

+ -2 

-f-M 2 

+ •96 

-•08 

+ ■4 

+ *28 

4 - ’84 

— 12 


18. The above example shows that different degrees of accuracy 
may be obtained by choosing different sets of data on which to 
work. The general theory follows the same lines as that for single- 
variable interpolation. It will be remembered that the ordinary 
advancing difference formula may be applied to the expansion of 
Uj, in terms of the differences of on the assumption that y = Uj. 
is a rational integral function of x. In these circumstances we may 
represent the function graphically, and the successive values of x 
and y will be points on the plane curve y = u^. When we are 
considering a function of two variables x and y we assume similarly 
that we may represent z = u^.y as a surface. Now in Chapter V 
(paragraph 15 ) it was proved that the effect of including higher 
differences in the expansion for does not necessarily give better 
results than if they are neglected. In the same way it may be 
shown that by choosing more points on which to work we may 
exceptionally produce a result farther from the true value z on the 
surface z = u^.y than we should obtain by relying on fewer data. 

With regular data the formulae with x, y in the central area of the 
given points are usually preferable. In the space for which x, y are 
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both positive and less than which includes all cases if the origin and 
direction of the axes are suitably chosen, a simple central difference 

formula is , 1 / x 1 / x 

«x:» = «o:o + i* + 

This is based on five points. 

This formula and the six-point formula consisting of the same terms 
with the addition of 

i** (“-1 : 0 - 2 “o : 0 + “1 ; 0) + : 1 - 2«0 : 0 + «0 :-l) 

are probably the most useful interpolation formulae for ordinary 
actuarial purposes (Todhunter, J.I.A. vol. Liii, p. 89). In some cases 
either the first or the second additional terms may be omitted (see end 
of paragraph 15). 


• 19 . Taylor’s theorem for a function of a single variable, 

f(x+h)=f(x)+hf'(x)+^^r ix)+..., 

may be written symbolically as 

/ {x + h) = f (x) (see paragraph 6 above). 

d ? 

Denoting as usual partial differentiation t>y we have 

h y 

f{x + h,y + k) = e ^^/{x,y + k) 

V (*.>-) 


h~ k 
= e e 

— f?x dv 


This formula is of theoretical interest. For use with tables the 
partial differential coefficients must be replaced by differences, so 
that in effect the formula repeats the difference formula already 
given. 

Thus, for quinquennial intervals, 

* * 

(i+A^)»(i-|-A„)* = e 
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h k 0 9 

so that -log(i + AJ + -log(i-)-A^)sA^+A:^; 


[* V+ ■)+* (^.-^^ ■)] 
+ * I-)* " 4 A, A, + A^A/] + . . . . 


INTERPOLATION FORMULAE: FRASER’S 
HEXAGON DIAGRAMS 

20. No demonstration of interpolation formulae would be com- 
plete without reference to Fraser’s graphic method. In this 
method the ordinary differences of a function of x are combined 
with the relation (a;-|- i)(,) = JC(r) + a:(,_i) in diagrammatic form so 
that by adopting certain conventions any finite difference formula 
can be written down immediately (Fraser, jf.l.A. vol. XLiii, pp. 
235 et seq.). 

We have (•« + <+ i)(,) = (*+0(r) + (* + 0(r-i). 

or (a: + /+ l)(r)~(^ + 0(r) = (^ + 0(r-l)- 

A relation similar to the fundamental finite difference identity 
— Uj. = Au^ exists therefore between these coefficients. If we 
carry the analogy still further we can construct a table of values of 
(x -(-<)(,) corresponding to a difference table. 

The tables are set down in reverse order thus: 





(* + 3 ) 0 ) 


A«_2 


(* + 0(2) 


A'‘*«_2 

(*+0(1) 

(* + 2)(8) 


A«_i 

A^_2 *(o) 

(*+ 0(2) 

“0 

A*«_, 

AXj X(i, 

(*+ l)(3) 


Auo 

AXi (*-0(o) 

*(2) 


AX 

(*-0a) 

*(3) 


Aui 


(*-0(2) 

«2 



(*-0(3) 
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H3 

If now these two tables be combined, we have the following 
scheme, where, by convention, (a: — i )(o) . . . = i : 



where for any one of the hexagons w'e may write in general 

(x + I - I )iri 

(* + f - I 

Now 

(ar + / - 1 )(,, A’-u_, + (v + > A-" - (v -f t - i 

= (x + 1 - I )(,, [A'«_< - A’-tt_,+ J 4- A’-+‘«_, [(jc + 

-(x + r-l)(r+,,] 

= (x + t - l)(r) [ - A'+1«_J + A''+‘u_, (x 4- 1 - l)(,, 

-= 0 . 

A relation is therefore established between the constituents of 
the various hexagons. If we make the following assumptions: 
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(i) the oblique lines denote multiplication and the horizontal lines 
addition ; (ii) a line taken in a right-hand direction gives the product 
a positive sign, and in the opposite direction a negative sign, we 
can say that the sum of the operations performed in travelling round 
any hexagon is zero. It follows easily that the sum of the operation* 
in travelling round any closed circuit is also zero. 

It is evident from a consideration of the diagram that if we travel 
from any value of (ie-l-t)(*) to any difference the result will 
be the same whatever route be taken. For example, from (x— i)(o) 
through Mo, X(i), Aug, X(2) to A%o and (by completing the hexagons) 
back along (x-i)(2,, Au^, (x-2)(i), (x- 2 )(o), to (x-i)(o) 

again enables the following identity to be established: 

(a;- i)(o)“o + *a) Amo + X( 2 ) AX-(a;- 1 )( 2 ) A*Mo- (*- 2 )a) Aui 
- (x - 2)(o)M2-|- (X - 2 )(o)Mi - (x - l)(o,Mi = O. 

Re-writing this, we have 
Mo + xAuo+ia: (x-i) A^Mo-i(x-i) (x-2) A^ 
or -(x-2) Ahi-M2 + Mi-Mi = o, 

Mq + xAMo+^X (x- i) A*Mo = M2 + (x - 2) AMj-t-i (^“ 2 ) (*- l) A®Mo. 

Exactly the same result will be obtained by proceeding along an 
alternative route 

(x— i)(o)i ^0) ^(i)> Amo, ^(2)> A*Mo 
and back through 

(x- i)(a), Ami, (x- i)(i). Ml, (x- i)(o). 

The identity will be 

(x-l)(o)Mo 4 -X(i)AMo+X( 2 )Ai 2 Mo-(x-l)( 2 )A 2 Mo-(x- i)(i)AMi 

or Mo-f-xAMo-t-^x (x- i) A^o=i (*-2) AX 

-l-(x- 1) Amj + Mj; 

i.e. =i (x- i) (x-2) AX+(*- 2 ) Ami-(-Mj, 

the same result as before. 

21. Applicaticm of the hexagon diagram. 

The above example gives a formula for Mj in terms of Mo , Amo , AX 
and Ami, and if we put x = 2 we have a well-known identity. A 
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similar process will give a formula for u„, and since we may take 
various routes a number of different expansions of will arise, all 
giving exact expressions for It should be further observed 
that when an nth difference has been reached by travelling along 
the upper route the terms other than in the lower route will be 
Zero, and it follows that by travelling round any circuit we obtain 
expressions involving an initial term and terms of lower degree 
than n. This is seen to be so by considering all the coefficients 
along the lower route will contain (jc — w) as a factor and will 
therefore vanish when x = n. 

We have therefore from the diagram the following expansions: 

(i) Wn = Wo + ”<i)'^“o + ”(2)^^o + ”(3)^®Wo+ ••• (Newton’s formula). 

(ii) u„ = «o+na)^“o + «<2)A“«_i + (M+i)( 3 )AVi 

+ (m+ i)(4)A^u_ 2+ ... (Gauss’s forward formula). 

(iii) «„ = Uo + « (1 ) Au_i + (w + I ) (2) A*«_i + (n + i ) (3 > A^u.j + . . . 

(Gauss’s backward formula). 

(iv) = Uj + (« - i)(i) Auo+ n(2)AX+ M(3) 

+ (n + i)(4)A*M_g+ ... , 

The mean of (ii) and (iii) gives Stirling’s formula, and the mean of 
(iii) and (iv)can be arranged to give either Bessel’s or Everett’s form. 

22. Mr Fraser has shown that the hexagon diagram can be used 
for divided differences in a similar manner to that given above for 
differences at equal intervals. (D. C. Fraser, Newtor and Inter- 
polation — a Memorial volume issued by the Mathematical Associa- 
tion, 1927.) 

A typical hexagon involving divided differences is 
Aua <x-a) (x- i) 

(x-b)( 

w 

(x-i)(,x-e) 


PM A S 


to 
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and the construction of the system follows closely the lines of that 
for ordinary differences. 

The A diagram is a special case of the A diagram, since ordinary 
differences are themselves divided differences when the given 
values correspond to successive integral values of the argument. 
The results in the preceding paragraph could therefore be obtained 
by a consideration of the hexagon diagram for A , the modifications 
necessary to produce the simpler forms of the central difference 
formulae being introduced at the last stage. 

OTHER SYMBOLS OF OPERATION 

*23. It has already been shown when considering the common 
operations of finite differences, Au, Eu, Su, that the symbols 
denoting the operations can, within limits, be treated as obeying 
the ordinary algebraic laws. By omitting the function u the various 
processes can be applied to the operators alone, with a resultant 
simplification of procedure. For example, the method can be 
adapted to the needs of the infinitesimal calculus, and Dr Aitken’s 
introduction of his 6 operator has produced an interpolation 
formula of extreme generality (see Chap. VIII). Other operators 
have been devised for special purposes connected with finite 
differences, and an interesting example is given below. 

*24. The operator V. 

In Chapter IV attention was drawn to certain symbols of opera- 
tion which may be considered as supplementary to the A and E 
which are the basic operators in finite differences. These symbols 
—namely S and ^ — may also be assumed to follow the normal 
algebraic laws (with the usual limitations), and the method of 
separation of symbols may be applied to them equally with A, E 
and S. A further symbol has been introduced connecting with 
the next lower value Instead of with the more usual value 
This symbol is V, and Vk^ is defined as Uj. — «j_i. 

Corresponding to Au,. = (E — i ) u,. , 

we have therefore Vuj. = (i —E~^) . 
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Thus, for example, 

= W* - + «( 2)«^-2 - ”(a )«*-3 + • • • • 

In addition to the familiar 
Ajd”*) 

there is a similar relation 

y^-m) 

where =( — i)”* ( — xy™* and not the inverse factorial defined 
on p. 19 ; and if we denote the product 

X (x+ Jn — 1) (x+ — 2) ... (x- ^n + i) 

by x^"' it is easy to show that 

(Steffensen, Interpolation, pp. 8, 9). 

No new principle is involved in dealing with these further 
symbols of operation; their introduction simply enables us to 
develop expansions and to write down formulae for interpolation 
with a minimum of labour. 


OSCULATORY INTERPOLATION 


•25. It may happen that we know the values of at intervals of a 
unit, and that we wish to calculate a complete table of values with 
smaller intervals. For example it is a common practice to calculate 
every fifth value in a life-table, and to complete the table by 
interpolation : here the unit interval for the preliminary calculations 
is five years. 

If we decide to use a third difference formula then every inter- 
polation involves four of the given values. For the interval o to i 
the best course is to base the formula on the four values Mj,, 
Uy, U2, thus giving equal weight to values on either side of the 
interval. An appropriate formula is Bessel’s formula, namely, 




X (x- I ) A’Hip -f- A*«_i 

2! 2 




xo-a 
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This is the formula for the interval o to i . For the interval i to 2 
we should use the corresponding formula based on the four given 
values Mq, Wj, Kj, M3. 

Thus for the two intervals o to i and i to 2, the two interpolation 
curves have a common ordinate when jc = i ; they may not 
necessarily have a common tangent. Two neighbouring interpola- 
tion curves will usually cut one another at the point of junction, 
and while there will be a smooth run of values within each unit 
interval the values will not run smoothly with those in the next 
interval. 

We are led therefore to enquire whether we can find a series of 
curv'es of interpolation which shall have common tangents as well 
as common ordinates at the points of junction. Such curves are 
said to have contact of the first order at the points where they join, 
and the necessary condition for this contact is evidently that 

and must have the same values at these points on the one curve 

as on the other. 

The interpolation curve given by Bessel’s formula (i) for the 
interval o to i was based on the conditions that it should have 
“-!) “0. “1. “2 ordinates. We retain the condition that and m, 
should be ordinates and abandon the condition that m_i and 
should be ordinates. Instead, we shall stipulate that the differ- 
ential coefficients of when x = o and when ;)c=i have known 
values. To fix these values we shall proceed as follows; 

The interpolation curves for the two intervals — i to o and o to i 
are to have the same tangent when x = o as the curve of second 
degree which has u_i , Uq and Mj for ordinates. ’I’he equation to this 
curve may be written 

^*(-1.0. = i (“o + “i) + (*-i) Amo-|-|x ...(ii) 

The interpolation curves for the two intervals o to i and i to 2 
are to have the same tangent when x = i as the curve of the second 
degree which has Uj , m^ and Uj for ordinates. In a similar manner 
to the above the equation to this curve may be written 

Fj,(o, i,2) = i(uo-j-Mi) + (x-i)Auo-|-i*(x-i)AV •••(iu) 
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It will be noticed that the forms which we have chosen for the 
equations (ii) and (iii) differ only in the last term. 

Differentiating (ii), 

i) = A.Uo+{x-l) 

so that Ff)' ( — 1,0, I ) = Auo — (iv) 

Differentiating (iii), 

F^' (o, I, 2 )=Awo+(*-i) AX. 

and Fj' (o, I, 2 ) = Auo+iAX- (v) 

The values of Fq' (—1,0, i) and F,' (o, i, 2) in (iv) and (v) are 
to be values of the differential coefficients of the required inter- 
polation curve for the interval o to i. 

Let be that curve, so that its ordinates when * = — i, o, 1,2 
form the basis for an ordinary^ interpolation formula of the third 
degree : 

* (ac — I ) AX + 


+ ^ 1 ) + (-^ - i ) <^^'0 + ■ 


2! 2 

(jc-|)x(a;-i) 

3! 


AXi- 


.(vi) 


The tangents to this curve are given by the equation 


<=Az)o + (a:-^) 




+ - 




..(vii) 


The conditions to be satisfied are: 


■* 0 ’ 


.(viii) 


''o'=-^o' (-L o. 

^1' = (o. 1.2) = A«o+ iAX- 

We have at once i (^'o + ^i) = i (“o + “i). 2nd Ar>o=Auo; these 
determine the first and second terms in (vi). 

IT / -x ' \ iA%„q-AXi I 

From (vn), Vg = At^o — H A®r/_, , 

2 2 12 

2 2 12 ^ 


2 
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and therefore + fj' = 2 Aoq + , 

Vi-Vo' = i (A*»o + 

But, from (viii), 

t'o' + Wi' = 2 A«o + i ASm_i , 


W - ®o' = i 

Comparing the two expressions for + ^/ we have at once 

A®o_i = 3A®u_j (since At;o = Auo), 

and from the two expressions for r>j' — u/ 

i (A% + A*t;_i) = J (AX + A*«_i)- 

We have now found the values of all four terms of the formula 
(vi), and we can write the formula in terms of m’s as follows: 

X (x — I ) A^Wy + A^_j 


«'i = i(“o + “i) + (^-i) ^“o+- 


+ 


2 2 

x(x-i)(x-^) 


A®«_i. 


.(ix) 


This result is a formula of osculatory interpolation, and differs 
from the ordinary central difference formula (i) only in the last 
term. 

The difference is 


x(x-l)(x-i) X (x- 1) ( 2 X- l) . - 

v^-u^ = A*m_j = — ^ A ’ A»M_1 . 


•26. The problem of osculatory interpolation has been a fruitful 
source of discussion by eminent actuarial authorities. The method 
was devised by Dr Sprague (see J.I.A. vol. xxii, p. 270) and was 
subsequently developed by Prof. Karup, Mr George King and 
Dr Buchanan. An elementary demonstration of the method, de- 
pending upon advancing differences, is given by Mr King in the 
Supplement to the 75^/1 Annual Report of the Registrar-General. 

The following modification of Sprague’s method, using King’s 
approach, has been suggested by Mr P. G. Neal. 
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Consider the four values of u_^, Ug, Uj and u^. 

The second degree curve through u_j, u^, and is 

M* = M_1 + ( I + x) ^ A*M_i 

. x(x- i) 

= 1/0 + xA«o + — A-m_j : 

. ■ . «*' = AU(, + xA*a_j — I A‘‘‘u_i 

so that [m^%_o=Auo-^A2m_]. ( 1 ) 

The second degree curve through Uq , w, and is 

. X (x-i) .. 
u^ = Uo + xAmo+ — Ahio; 

«/ = Auo+xA^^ifl-iA^ 

K']x-o = Auo+iA*u„. ( 2 ) 

Let the third degree curve through u_i, «*, Uj and Uj be 
«^ = Ufl + ox + 6x* + cx®. 

Then u^.' = a + zbx + ^cx^, 


a + 6 + c = Wj-u„ = Aup. (3) 

From (i) <2 = Aiip— ^A^u_j. (4) 

From (2) a4-26 + 3c = A«p+ JAV (5) 

Whence, solving equations (3), (4) and (5), 
a = A«o - JAX + ^ A®u_i, 

6 = iA’h<o-A 3 u_„ 

f = ^A»u_i. 


/, = «,, + X (Auo - iA*Uo + iA*«_i) + X* (i A*i/o - AV.,) + x®|A»m_i 

= Mo+xAuo+^x (x- 1) A*Uo+ lx (x- 1) (x- i) A®u_i 
(where, it should be noted, the last term replaces 

X (x-i) (x-2) . - 
6 ^ ^ 

in the ordinary advancing difference formula). 
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Putting Amo = Am_j + A*m_j and A^^ = A®m_i + A®u_i we obtain 
easily + xAm.i + J (ac^ + jc) A*m_i - ^ (ac* - a:®) A®m_i 

or = Ml + xAmo 4 - ^ (a;** + x) A®m„ - | (a:® - x^) A^Mq, 

which is the form given by King. 

*27. The proof in paragraph 25 above is due to Mr D. C. Fraser, 
and depends on Bessel’s formula. The ordinary interpolation 
formula ending with the term A®m_i can be written in many 
different forms, all giving identical results, and the addition of 


the expression 


A! (ac — l) (2JC — l) 


A®m_i produces an osculatory 


formula. 

Suppose, for example, that we take the descending difference 
formula 

. . . AC (ac 4- I ) . , x(ac''^-i)., 

Mj. = «_i 4- (jc 4- 1 ) Au_i 4- — ’ A^m_j 4- - — -- A*m_j . 


Adding the term - 


AC (ac— i) { zx — i) 


A®m_i we have 


v^ = u_i + (x+ i) Au_i 4- AC (x4- 1) A*m_i 

M-V- - 0 (2V - l)) 


^ X (.c4- i) . „ x^ (x- i) 

= Mi,4-xAu .4- A-m .4- -- ' A®m ,, 

2 2 

which is the form obtained in Lidstone’s proof appended to 
Dr Buchanan’s paper, J.I.A. vol. XLii, p. 394. 

An interesting Note on the application of a graphic scheme to for- 
mulae of osculatory interpolation appears in the Actuarial Students' 
Magazine, No. 3 (Edinburgh, 1930). By treating osculatory interpolation 
as a particular case of divided differences, Mr Fraser shows that a 
diagram similar to the hexagon diagram for ordinary differences can be 
employed to obtain the various forms of osculatory interpolation formulae. 


•28. Modern writers, notably American and Scandinavian 
actuaries, have of recent years turned their attention to the 
construction of formulae of osculatory interpolation. In a paper 
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read by Dr Buchanan before the Faculty of Actuaries in 1929 there 
is an extremely interesting account of the development of the 
subject up to that date (T.F.A. vol. xir, pp. 117-56 and 277-82). 
Of the numerous formulae given in that paper those of Mr R. 
Henderson and Mr W. A. Jenkins are of particular interest. 
Henderson’s formula is 

+ xuj + (x* - I ) (8^1 - ^8*«i), 

where ^=1— x and the 8’s are central difference operators in 
Sheppard’s notation. 

It will be seen that this is Everett’s formula with fourth difference 
adjustments. 

Jenkins’s formulae are also based on Everett’s formula. His 
earlier one is 

- 0 - A?* (f - 0 sx 

+ xui+ ^x (x^- 1) 8X-T2^ (*- i) 5 *Mi, 

where, as above, f = i - x; while his “modified” osculatory inter- 
polation formula (which, in many respects, possesses advantages 
over existing formulae) is 

+ X«1 -I- ^X (x^ — I ) 8 ^Mj — ^X^S^Kj . 

It should be noted that, in general, Henderson’s and Jenkins’s 
curves do not, as do Sprague’s and King’s, pass through the 
given points. They thus involve an element of adjustment as well 
as interpolation. 

The student who wishes to pursue the subject further would be 
well advised to read this paper, with special reference to Buchanan’s 
demonstration of Jenkins’s modified formula and the derivation of 
the formula from first principles given by Mr D. C. Fraser in the 
discussion which followed the paper. 

A comprehensive study of the methods adopted by the latest writers 
on osculatory interpolation will be found in an illuminating article by 
Mr J. E. Kerrich in J.I.A. vol. lxvi (pp. 88-124). The methods of 
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American and English actuaries differ from those of the Scandinavian 
authorities, and in his paper Mr Kerrich discusses the differences 
between these methods and gives a general procedure for the derivation 
of the various formulae that have been evolved. 


EXAMPLES 7 


1 . Show that A™*" — o or « ! according as w > or = ». 

Hence prove that if « and r are positive integers 

(n- !)’■ + «, j) (n-3)’^+...=o or n! 

according as r < or = «. 

2. Use differences of zero to find (2‘75)® and ( — J)^. 

3. Prove that (n + i) A”o". 

4. Prove the identity 

tt! AfthIqII+I 

A"©" + «,1, A^o"-! + A“o—2 + . . . + • 

(W — fn) ! fTl’t l 


5, Show that {ft + 1) A"©" = 2 [A""k>’’ + A"o'’]. 

6. If Su, 8 *m, 8®u, ... represent differences for intervals of i/m and 
Att, Ahi, A*tt, . . . differences for imit intervals, then if fifth differences 
are constant, prove that 


AV A\ 


iomA*o*-A*o^ A*«p 
5I ■ m®'* 


7. Show that / (E") o’" = n”'f {E) o’". 

8. If AC be any quantity less than unity, prove that the limit of the 
series i'' + 2"* + 3"**+ ... to infinity 


(I-AC)= 


Ao" + A V 

I —X 




* ) 
A*o" + . . . 


f- 


9. Prove that the differential coefficient of / (n) with respect to n is 
approximately equal to 

I {/ (« + 1) -/ (n - i)} - yV {/ (n + 2) -/ (n - 2)}. 

10. Find the first three differential coefficients of -^x, when * = 50, 
given the following cube roots : 

■^50 = 3-6840; -^51=3-7084; -^52 = 3-7325: ^53 = 3-7563; 

= 37798: ^55 = 3-8030; 1^/56 = 3-8259. 
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II- M6=I'5S6. «7= 1-690, 1-908, Mj2 = 2-158. 

Find the value of when x = 8, by using divided differences. 

12. Prove that 

\ ^ \ I / \ 

^ ^x—h) V*Iii+4fc ^x-2h) + ^ xMx+Sh ^x-ih) •••• 

13. The first differences of the first differential coefficient of log w, 
are in geometrical progression. Determine the form of u^. 

14. Show that = Ay (jc — f ) approximately. 


By considering the function / (x) = a + ix -f c* and using the above 
relation, prove that log r = rf — approximately, where c is a small 
quantity. 

15. Given that Uo = 5, Ui = is and u^=S 7 < value of 

is 4 when x~o and 72 when a; = 2, find the values of and A^Uq. 


16. 


Show that (i -f log Ey o™ 


rl 


when r > m. 


17- 

18. 


If 8«e = tta+j - Mj_j, prove that 


dx * 24 * 640 


Prove that 


-f - . . .. 

19 . Given 

“ao:i 6 = ^'°®4 “ao;a> = 4'3®4 

«iis:i6 = 6-029 «j5.,jo = 4-346 

“so: 16 = 6-075 

find U23;j7 as accurately as possible. 

20 . Obtain a Lagrange formula for given u^ro, «i;o> “o;i> “i:i- 


21. Show that if = and «o:2 = “2:o> *^®ii 

«*:, = “o:o + (*+j) A,+- *'*^ - ^ A/ Mo:o + *?' [«i:i-«2:o]- 

22. “40:45= i3'i33 “40:60= i2’45® fl4j;4o= 12-880 

“60:40=11-898 a„:„= 12-432 

Find “44:44. 
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23. Given the following premiums for endowment assurances, obtain 
as accurately as possible the premium for age 23, term 17 years: 


Age 

Term 15 years 

20 

5-947 

25 

6-046 

30 

6-144 


24. Find given 

“20: so = 3 ' 1000 

*<20:25 = 3-2625 

*<20:80 = 3-5042 
*<40:36 = 3-8458 

*<26:25=3-5000 


Premiums 

- A - ■ 

Term 20 years Term 25 years 
4-418 3-547 

4-530 


*<26:30 = 3-6875 

*<28:s6 = 3-9542 

*< 30:30 = 3-9333 
*<3o:s6 = 4 -* 4*7 
“35:35 = 4-5500 


25. The following values of / {x,y) are given : 

/( 35 . 55 ) = 10-020, /( 3 S. 50) = 11-196, /( 35 , 45) = 12-019, 

/(40, 55 )= 9-796, / (40, 50) = 10-894, /(40, 45) = 11-641, 

/( 4 S, 55 )= 9-583, /(45, 50) = 10-591, / (45, 45)= u -243. 

(i) Using only six of the above values, find 7(42, 52). 

(ii) Making use of all the data, calculate / (44, 51). 

26. Prove that A (u^v^u'^) can be expressed in either of the two fol- 
lowing forms: 

(a) Aw^ + w ^ Av ^ + v ^ w ^ Au ^. 

(b) A«iAt)j.AWj. + M,t;^Aw^ 4 -two similar terms 

+ M*Ar:jAw,.-l-two similar terms. 

•27. If V/(x)=/(x)-/(x-i), 

prove that 

^"/(*) =/(*)-'*<l)/(*-l)-l-«(2)/(*- 2)-. •• +(-!)"/(*-«). 

•28. Prove that 

f:*«o=[i-(-X(„v+(*+i),„ v* 4 -(*+ 2 ), 3 , v»+... 

• 4 (*- 4 «)(n+l) u^. 



EXAMPLES 

29. If - X {x + In - ■ 

prove that (jc''^ ~ t*) — 2") . . . (** — w — i 

and xt2«+«=:c(*2-i)(*2_|)... [*2_ J (2„ _ i)2]. 

30. Show that 

/W=/(o)+-^r8/(o)+*-, 8y(o)+... + ^ SY(o)+..., 
where has the same meaning as in Qu. 29, and 

¥(*)=/(* +i)-/(*-i)- 
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CHAPTER VIII 


MODERN EXTENSIONS AND SPECIAL 
DEVICES 

*1. In this chapter we shall deal briefly with certain modern 
methods of interpolation and summation. The proofs and applica- 
tions are not intended to be exhaustive, and for further information 
on any of the theorems or methods a study of the original papers 
or notes is recommended. 

AITKEN’S THEOREM FOR POLYNOMIAL 
INTERPOLATION 

•2. Dr A. C. Aitken has devised a remarkable general theorem for 
polynomial interpolation which covers a very wide field, embracing 
all the ordinary formulae as particular cases. The basis of the 
theorem is that when a difference operator is applied to a given 
polynomial of the nth degree in x, the result is a polynomial of 
the (n— i)th degree, and when performed on a constant the result 
is zero. This obviously holds when the operator is A. Similarly, 
when Uj. is differentiated with respect to x, a polynomial of one 
degree less in x is produced; and if .<4 is a constant, then dAjdx 
is o. In this case the operator is D. 

The general operators are 0 and 0. 0 denotes that form of 
inverse operation which produces a polynomial divisible by x, 
so that constants (of integration, etc.) do not enter into the inverse 
processes, which are therefore perfectly definite. 

It is evident, in the first place, that whatever operation is repre- 
sented by 0, 0 1 is X, and the effect of operating with 0, or a succes- 
sion of 0’s, gives a multiple of x. 

From the definition, since Uq is the constant part of u^, which 
is removed by the operation 0, 

00Wj. = «^-Mo 
W* = Mo-|-00M,. 


so that 
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If for example d = A and the interval of differencing is unity, then 

0I=X(J), — ■^( 3 ) ••• 

and generally 

Again, since A (x- i)(n) = (^- i)(n-i) 


0 {X- 0l9 (x- l)(„) = (x- !)(„)+ I, 

according as n is odd or even. 

Similarly, if 9 s D, it can easily be shown that 


0 




= Q9 


n! ' 


«! 


*3. The general formula for determining the polynomial P„ (x), 
being given values when x = o of and of all such results as 
... 9^ ... 9^9yU^, is 

= Wfl + (^i “«) 02 1 + (^A He) 0j © 2 1 + . . . 

+ (9„ ... 02^1 Ho) ©i©, ... 0„i, 

where the expressions in brackets are numerical values, and the 
expressions following them, such as ©i i , ©j 0j i , . . . , are functions 
of X. It will be understood that, in each term, the first operator is 
the one that stands next to the operand « or i, and that the re- 
maining operators are taken in their order from right to left ; in 
this order the subscripts of 9 increase and those of © decrease. 

It should be noted also that the 6’s are not necessarily all the same 
operation, nor need they be commutative. The following proof is 
due substantially to Mr G, J. Lidstone. 

First, let be a cubic in x and = ox® + bx^ -r cx -t d. 

Then since is of the third degree, is of the second degree, 
is of the first degree, and 9y6^9^u^ is a constant. 

Now u^ = Ug+&i9iUj.. 

Hence, putting 9jU^ for u^, © 2^2 fo"" ©i^i similarly, 

9iU^ = diUo-^%9^eiU^, 

= 02^i«o+ ©s^s^a^iHx. 

= constant. 
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There is no term in © in the last identity because the process 
has been continued until x disappears. 

From the last line above, operating with ©3, 

© 3 ^ 3 ^ 2 ^l“a~ © 8 ^ 3 ^ 2 ^ 1*^0 

and since constant we may write this line as 

© 3 ^ 3 ^ 2 ^ 1 *^*~ (®S^2^1*^o) ® 3 ^- 

Substitute this result in the third line: 

020l//^ = 0201«3 + {03020j«o) ©3I. 

Operate on this with ©2 and substitute in the second line : 

= ^l«0 + (^2^1 “0) ©2 I + ('? 3 l 92 ^ 1 «o) ©2 ©3 I • 

Finally, operate on this with ©^ and substitute in the first line : 

“z = “0 + (^1 “0) ©1 1 + (02^1 Mo) ©1 ©2 I + Mfl) ©1 ©2 ©3 I • 

This process can evidently be e.xtended to the general case, so 
that we arrive at the general formula : 

M* = Mo + (01 Mo) © 1 1 + (02^1 Mo) ©1 © 2 1 + . . . 

+ (0„...020iMo)©|02... ©„I, 

which is Aitken’s formula. 

It will be observed that the proof is exactly analogous to that 
of the divided difference formula (Chap. Ill, p. 44). 

• 4 . By replacing the 0 ’s by more familiar operators various well- 
known formulae can easily be obtained; 

(i) If 0 j = 02s03= ... = A, then as shown above, ©, i=x^), 

0j0j,i =X(2) ..., and the formula in paragraph 3 becomes 

Uj = «o A«oX(i) -h A\X( 2 ) + A\x^ 3 ) +... 
which is the ordinary advancing difference formula. 

(ii) If the 0 ’s are all equivalent to the operator D, we have 

x^ 

u,, = Mo + DmoX 4- DX “j + -D^Mo ^ + • •• 


which is Maclaurin’s series. 
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(iii) Let = -«„)/(*— a) so that ^1^2= A*, and so 

a ba 

on; then it can easily be shown that the formula repro- 
duces 

^x = ^a + {x-a) A,Ua + {x-a) {x-b) A^u^+..., 

1/ be 

Newton’s divided difference formula. 

* 5 . These paragraphs are based on the note by Dr Aitken in 
J.l.A. vol. LXi, pp. 107 et scq. In the same volume appears a 
note on Lidstone’s extension of the theorem to interpolation in 
Everett’s form. In thus extending Aitken's theorem, Mr Lidstone 
has introduced an operator. A, which reduces the degree of a 
polynomial by 2, and not by i, as in the general form, and the 
corresponding inverse operator AsA*^. This ingenious modifica- 
tion results in some very interesting identities and renders the 
development of Everett’s series a comparatively simple matter. 
{Ibid. pp. 113-16 and the original paper cited therein.) 

The student who desires further information on the subject of 
Aitken’s formula is advised to read Dr Aitken’s original paper, 
Ptoc. EJin. Math. Soc., Series ii, vol. I (1929), pp. 203 et seq. 
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•6. A formula for the finite integration of a funetion which is 
the product of two other functions is 


An extension of the formula (Chap. VII, paragraph 12) is 
S“x^x = “x^t’x - -^^Wx^^^x+l + -■■■■ 


■■■{a) 

If is a polynomial of the nth degree in x, differences of a 
higher order than n will vanish, and the last term in the series 
above will be a„ 

The series can however be terminated at any stage with a 
remainder term. 
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For example, 

80 that = A (%So,) - AM^Et;,+i 

O'" - 2 (A«*S®^+i) {b) 

If in place of and we write — Au^. and respectively, 
we have 

-S (Am^S®,,+i)= -Au^S*i;,+, + S 

Identity (A) then becomes 

+ S (c) 

By substituting A^^ and for u^. and respectively in (c) 

itcaneasilybeseenthattheremaindertermafter( - 1)” A’'M3.S"+^r'j.4_„ 

be ^ ^ (A»+^«,S»+X.«+i)- 

(See G. J. Lidstone, J.I.A. vol. lxiv, pp. 160 et seq.) 

• 7 . If in formula (a) we replace u^. by A«j. and t'j. by i , and insert 
the limits x and o, we have 

X 

S(AMj. i)=A«jJ:(1,-^‘‘«i(*+'')(2) + ^®«*(* + 2 )( 3 )-- — 

0 

But E (Au^) = Uj. — C, where C is a constant 
= u^-Ug, say; 

«x - Wo = ^(1) -{x+l)(i) AX + (* + 2) (3) A®W:r " ■ • • 

or «0 = M* -x,[, Am^ + (jc 4- i)( 2 ) ax - (x + 2)(3) AX + ■ ■ • 

which is the ordinary advancing formula for Uq in terms of and 
its differences. An alternative form for is 

^U^Vx = «x-i2^»x - A«x + A2«^_3SX - ('^) 

(See Examples VII, No. 18,) If we make the same substitutions 
as above, we shall have 

«x - «o = ^( 1 ) -^^Wx-i - -^ca) ^X-2 + ^(8) AX-3 - • • • 

or Uo=^x-X(i) Am^_i + x,2) A*^^, - A«m^ + . . ., 

an interpolation formula involving backward differences. 
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To distinguish between the two expansions for we shall 

therefore refer to formula (a) as the “forward difference” summa- 
tion formula and formula (d) as the “backward difference” 
summation formula. 

•8. These formulae can be adapted to the problem of calculation 
of when numerical values of and are available. Mr 

G. J. Lidstone has shown that by making some simple substitu- 
tions the arithmetical work is reduced to a minimum. 

The proof below follows Lidstone ’s proof in J.I.A. vol. Lxx. 

Let iSj. denote the indefinite finite integral of u^v^.. Then can 
be expressed in either the forward or backward form as given 
above. If we wish to find the sum of n terms we must calculate 

— .S'q, and if in the formula we make either or vanish by 
so arranging the summations as to give zero values to the particular 
values of the sums l,v, ..., we reduce the definite sum to 

a single expression, iS„ or — S^. Further, from the zero values we 
can build up the whole table of sums by using the fundamental 
relation 

Since in either the forward difference or backward difference 
summation formula we may choose our values so that or Sg is 
zero, there will be four forms of tabulation. For the purposes of 
illustrating the process it will be sufficient to use one of these 
forms : they are all illustrated in Lidstone’s paper, loc. cit. 

•9. Consider the simple case in which the following values are 
given : 

*0 23 

Uj 27 64 125 216 

^ 3 7 5 

Here n is 4. By simple arithmetic 

= 27 + 192 -h 875 + 1080 = 2174. 

(In this example has been taken as (.x + 3)® so that third differ- 
ences will be constant: the values of are quite arbitrary.) 
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The difference table is 


X 





0 

27 

37 



I 

64 


24 




61 


6 

2 

125 

91 

30 


3 

216 





We will use the backward form, making 5^ = 0 . 

Thus, to use (the last term — in this case m^) and its backward 

differences, we make o = = . . . for a; = o ; and in that case 

n— 1 

0 

Construct the following table, where the sums of are taken 
downwards and stepped down one line at a time: 

* i't’i 'L‘‘v^ 

0 I o o o o 

1 3 I o o o 

274100 

(n-i) 3 5 II 5 1 o 

(n) 4 — 16 16 6 I 

n— 1 

Then, substituting in the formula for Z u^v^y the required sum 

0 

IS 

+ 216 X 16= +3456 

— 91 X 16= — 1456 
+ 30 X 6= + 180 

— 6 X I = — 6 

+ 2174 

which agrees with the result found by direct calculation. 

•10. The methods adopted in taking any of the other forms are 
similar, and the student will find it instructive to use these other 
forms in further examples of the same type. 
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It should be noted that, by combining two forms, we may take 
the sums from some convenient central point and so reduce the 
figures involved in the summations. 

These formulae and methods are of great use in forming statis- 
tical “moments” of the form 


INTERPOLATION BY CROSS-MEANS 


*11. The divided difference formula is 


= + /h«a + (^-«) (x-b) 

b be 

If we are given only two values of 1 /^, namely and then 

“i = “a + (^-^) A«a 
b 

= u^ + {x-a) (i/fe - u^)l{b - a) 
b—x x—a 


a — x 
b-x 


-a. 


This is a blend of and in the proportions {b — x)j{b — a) and 
(x — a)j{b — a) and, from the determinant form, we may note that 
a first difference approximation to is in fact a cross-product 
divided by the difference between the two given arguments. This 
is called a linear cross-mean. 


•12. Dr A. C. Aitken has evolved a method of interpolation by 
cross-means which enables simple arithmetical processes to be used 
without the need for either a formula or differences. 

The following is an elementary description of the method. 

If in the general divided difference formula we substitute 
successively a, b, c, ... for x giving one, two, three ... terms, we 
may write down the following table : 
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No. of 
terms 

One 

Two 

Three 

Four 


“o = «a 

+ AWa 

b 


fr be 

Ud = Ua + id -a) AUa + id-a) (d-b) 

b be 

+ {d-a) {d~b) {d-c) /\^Ua 


bed 


Part* 

a~x 

b — x 


c — x 


d — x 


where the column headed “parts” represents the factors to be 
used later. 

The first step in the process is to find first difference approxima- 
tions based on and Uj , and u^, and ... as in paragraph 1 1 . 
These may be denoted by (a, b), (a, c), {o, d) .... In order 

to obtain these approximations we use the parts in the last column 
above, and it is evident that the first two terms of the general 
formula (i.e. Ua-i-(a:-a) i\Ug) appear in all the approximations. 


Thus 


u^{a,d) = 'Ug a-x 
I Ud d—x 


h(d-a). 

I 


The first difference results are therefore of the form 


Ug-\-{x-a) AUg + {x-a)ky, 
b 

where ki represents the multipliers of the common factor (* — a) 
in {a, b), {a, c), {a,d) .... 

We may construct a table similar to that above, the expressions 
in the middle columns giving the various values of : 


«x («. b) 
(«. 

M* («- d) 
{o, e) 


O . 

be 

(c-b) AX 

(d - b) AX +{d-b){d- c) AX 

be bed 

(e - b) A®Wa -I- (e - ft) (e - c) AX 

be bod 

+ {e-b) (e-c) (e-d) AX 

bode 


b — x 
c — x 


d — x 


e-x 
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The second difference approximations are found in the same 
way by the use of the parts as stated, and the results are of the 
form 


^x = Wa + {x-^) AUa + ix-a)(x-b) A?u^} + {x~a) (x-b)k^, 

b be 


the first three terms being common to all the values, and the 
respective values of being those in the middle column below: 


Uj, (a, b, c) 
{a, b, ( 1 ) 

»x («. «) 


o.A^u^ 

bed 

{d-c) 

bed 

(e - r) + (e - f) (e - d) 

bed hcrU 


C—X 


d — x 


e — x 


and so on. 


• 13 . An example will make the method clear. In order to simplify 
the arithmetic equidistant intervals have been used, but the 
processes are, of course, perfectly general. 

Example 1. 

Find Uj. when * = -7352 from the following data: 

X -2 —I o 1 2 

15^49 16218 16596 16982 17378 

We require the following table : 


a 

"a 


b 


(a, V) 

c 

“0 

“x (a. “x (a. c) 

d 


“x (a, d) {a, b, d) (a, b, c, d) 

€ 


“x(a, e) u^{a,b,e) u^{a,h,c,e) 

where the parts 

are -27352, -17352, - 7352 . -2648, 1-2648. 
/ b — x x — a 

Now 



where 


6 -x = - I 7352, 
a-x^ -2-7352, 


«.= 15849. 

= 16218, 
h — a = i. 


and 
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A (a, i) = i5849X -17353- (16218 x -2 7352) 

= 16858-3. 

The calculation ia performed very quickly by the aid of an arithmo- 
meter. 

Similarly, 

H,(a, c) = J [(15849X -7352 )~(i6s96x -2-7352)], 


since 

c — a = 2, 

and so on. 

= 16870*6 

Again, 

“x (a, b, c) = ^ _ ^ M, (a, &) + ^ 

where 

--7352, 


b-x== -1-7352, 
(u, 6) =16858-3, 
(a, c) = 16870-6, 

and 

h~a=i\ 


Uj, (a, b, c) = 16879-6. 
The completed table is as under. 


X 

— 2 

— I 

15849 

16218 

16 1 858-3 


0 

16596 

■ 870-6 

87 ' 9-6 

I 

16982 

, 882-0 

1 8-9 

2 

17378 

1 894-3 

i 9-1 


In practice, -work is saved by omitting common figures in the columns, 
as in the calculations of the cross-means these figures will merely be 
repeated. 

The final result is 16879. 

The gradual closing-up of the approximation is clearly seen. It is 
best to arrange the u’s in the order in which they come into Gau.ss’s 

formula, namely, «g, Uj, tt_j, In this way quicker convergence 

is secured. 


• 14 . A further adaptation of this method is by the use of quadratic 
cross-means. Here the values of u^. are taken in the order u_„, 



QUADRATIC CROSS-MEANS 


169 


u_fi, Uj, ... and the divided difference formula becomes 

= «-a + (^ + «) A M-a + (■*■* - a^) A®M_a 

o a,-b 

+ (A;®-fl*) (jc-l-i) A®M-a+--” 

a,—b,b 

The first stage in the calculations is interpolation to first differ- 
ences by linear cross-means as before; at each subsequent stage, 
by an arithmetically similar process, two further differences are 
allowed for simultaneously. 

Dr Aitken’s method of application is as follows ; 

(«. - a) = [(^ + «) “a + (« - *) «-a]/2a- 
Since this expression is unaltered by substituting —a for a, it 
is an even polynomial in a, say (a^). 

By letting a = xinu^ (a, — a), we have u^{Xy —x), which is 
The problem is therefore reduced to interpolating for (a:*), 
given v^(a^), t'j. (b^), •••• The repeated cross-means are 

therefore available, beginning with 

(a*, b^) = [(6* - x^) V, (a*) - (a* - x*) t;, (6*)]/(&« - a“) 


and similarly for (a^, c-) .... 

Again, 

(c- - X-) («2, 62) _ (fts _ ^-2) (a2, c2) 


z'x (a^, 62, c2) = - 


and so on. 

It is evident now that by interpolating on variables of the form 
r2 we are moving by two orders of differences at a time, which 
improves the convergence to a marked degree. Further, with a 
central origin a, 6, c ... are 5, 1-5, 2-5 ... and the factors 
a^ — x-y 62 — ^2, c2 — x2 ... all have the same decimal part. Also, 
the divisors b^ — a^y c^~a^y c^ — h^ ... are all even integers, which 
enables halved cross-multipliers and divisors to be used. Where the 
arguments are -5, 1-5, 2-5 ... these halved divisors become i, 3, 6, 
10 ..., the coefficients in the ordinary binomial expansion. 

Finally, for these arguments the first multiplier is i J — x-) of 
I (^ — x) (^ + x), which is one-half the product of the multipliers 
used in the first linear cross-mean for v (J). 

An example will help to explain the method. 
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Example 2. 

Using the data in Example i, together with «3= 17773, interpolate 
to find M.73J2 . 

The extra value is required, as in the quadratic method we need an 
even number of terms. 

The working is as follows : 

The decimal part of the multiplier is 

i (7352 X -2648) = -0973405. 

The v’s are 

■7352x16982+ -2648 X 16596= 16879-8 

1- 7352 X 17378 + 1-2648 X 16218= 16888-9 

2- 7352 X 17773 + 2 2648 X 15849 = 16901 '5 

The next steps are 

879-8 X 1-0973405 — 888-9 X -0973405 =878-9 
879-8 X 3-0973405-901-5 X -0973405 = 879-1 

Further values of v are unnecessary, and the table, similar to that for 
linear cross-means interpolations, is 

16 8798 -0973405 

888-9 87 ' *^-9 1.-0973405 

901-5 I 9-1 3-0973405 

Hence *<.7352= 16879, 2s before. 

* 15 . The cross-means methods possess particular advantages. No 
interpolation formula and no differences are required and we may 
stop the calculation at any point, so that any required degree of 
accuracy may be attained. For equidistant intervals the quadratic 
method is much to be preferred. When the intervals are unequal 
the linear process has considerable practical value ; it is, in fact, one 
of the most direct and effective methods for inverse interpolation. 

Generally, the quadratic method is very advantageous for direct 
interpolation. It can also be readily adapted to inverse interpola- 
tion. When the arguments are equally spaced and the calculations 
can be effected by the aid of an arithmometer this device enables 
interpolated results to be obtained much more rapidly than by any 
other method. For instance, in Example 2, after the multiplier 
has been calculated, the successive values of v can be obtained on 
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an arithmometer without clearing the machine after each operation. 
After the method has been mastered simple interpolations can be 
obtained in a very few minutes. 

• 16 . This part of the chapter is based largely on the note by Mr 
G. J. Lidstone, “Aitken’s new processes for direct and inverse 
interpolation” in J.l.A. vol. Lxviii, pp. 272-86. In this note 
there is a full and lucid description of the methods of interpolation 
by cross-means with a number of fully-worked examples, and the 
student will find it instructive to compare the advantages and 
disadvantages of interpolation by difference formulae and the 
methods of cross-means, by working out actual numerical questions. 


COMRIE’S “THROW-BACK” DEVICE 

• 17 . In the same paper (loc. cit. pp. 286-93), Mr Lidstone gives 
a comprehensive account (with references) of the throw-back derice 
of Dr L. J. Comrie, based on Bessel’s or Everett’s formula. This 
process is of great advantage when the interpolation is to be taken 
beyond third differences, as it shortens the work and reduces the 
number of differences to be recorded in the I’ables. The nature of 
the process is indicated in the next paragraph ; for a fuller account 
reference may be made to the paper mentioned above and to 
Comrie’s papers cited therein. 


* 1 8 . In Bessel’s formula (Chap. IV, paragraph 6), let the coefficients 
of the odd differences be called B^u, B'^u ..., and the coeffi- 

cients of the mean even differences S''m, B^u .... Then for 
the practical range of interpolation x = o to a:= 1, it is found that 
the ratios 




B'^u 

B'u'’ 


and 




change so slightly with x that they may, with sufficient accuracy, 
be assigned constant mean values provided that higher differences 
are not too large. 
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Adopting these values the formula takes the convenient form 

«a! = Mo+‘^'^“o + ^ [A* — •i84i*4--o38A® ...]«o 

[A,--io8A5 + -oi6A’ 

where Am, A®m, A®u ... represent the odd central differences and 
A^, A*u, A*m ... the mean even central differences. 

The quantities in the square brackets are evidently independent 
of X and may therefore be tabulated. 

The throw-back device may be used with great advantage in 
double-entry tables (i.e. functions of x and jy). Its use curtails 
very considerably the complicated interpolation formulae otherwise 
required when even moderately high orders of differences are 
involved. 

The device is also very useful for inverse interpolation, since it 
enables differences beyond the third to be included by a direct 
process without increasing the degree of the equation to be solved. 
Further, the last coefficient, J5'“m, is very small and changes slowly, 
so that allowance is easily made for the last term in the formula 
given above. 



CHAPTER IX 


APPROXIMATE INTEGRATION 

1. In order to obtain the area of a curve by the methods of the 
integral calculus two conditions must necessarily hold. These 
conditions are 

(i) the equation of the curve must be known ; and in that event 

(ii) the function y = representing the equation of the curve 

must be integrable. 

In the theory of life contingencies these conditions are rarely 
satisfied. Rates of mortality, marriage, etc. are generally obtained 
from actual observations and the functions derived from these rates 
are seldom capable of e.xpression in the form of a mathematical 
expansion. For example, if 1^. be the number of persons attaining 
exact age x in any year of time, the number living between ages x 
and jc + I is given by 



where x and t are independent. 

Unless follows some definite mathematical law, we cannot 
evaluate the integral by the methods hitherto employed. By making 
certain assumptions, however, we can obtain approximations to 
the value of the integral which are accurate enough for practical 
purposes. 

Thus, if we expand 4^, in terms of 4 and differences of 4, we 
have 

'« 

= 4+^‘^(r to first differences. 
But A4 = 4+1 ~ ^jri 

and 4~4+i is ‘4> the number out of the 4 Pteons who die 
before reaching age jc + t . 
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■= f dt as far as first differences 

Jo 

=4-H4-U) 

= i (4+4+i)- 

This is an approximate value of on the assumption that l^+i 
is a first difference function of /^. 

It is evident that a simple approximation to first differences will 
not be justifiable except in a limited number of instances: the con- 
struction of formulae for approximate integration will necessarily 
depend on the data available and on the degree of accuracy required. 

2. Simpson’s rule. 

Suppose that we are given two values of only and that we 
represent these two values by means of the points H and K 
whose co-ordinates are (a, m„) and {b, Uf,) respectively. Suppose 
also that we have sufficient information to justify our represent- 
ing the function y = by a curve of small changes of slope, 
not affected by periodic changes. 

An infinite number of such mathe- 
matical curves can be drawn to pass 
through H and K. The simplest 
curve passing through the points 
is a straight line, and for the pur- 
pose of interpolation it is often 
sufficient to adopt a first difference 
formula, as was done for example 
in obtaining the approximation to 
in the preceding paragraph. If 
the straight line be drawn, the area 



Fig. 3. 
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cut ofiF by the curve, the x-axis and the ordinates x=a and x = b 
will be that of the trapezium HNMK. The area of the trapezium 
is H«a + «4) 

Accurately, this area is j" Uj.dx. 

A first approximation to the integral is therefore + — a). 

If _y = u^, = l^+i and the limits b and a are i and o respectively, 

we have the approximation to f 4^^ dt given above, namely 

J 0 


i iL + ^x+l)- 


Let us now improve the approximation by introducing a third 
known value of u^. In other words, suppose our data to be a^, u^. 
Then there is still an unlimited 
number of curves which can be 
drawn to pass through the three 
points H, K, L. The simplest of 
these curves will now no longer be 
a straight line, but a second differ- 
ence curve. The form of this curve 
will be y = A + Bx-I- Cx^, and since 
three points on the curve are given 
we are in possession of sufficient 
data to determine the coefficients 
A, B, C. 



a 


Ma 


«6 


k -- b > 


c 

F>g- 4- 


Ue 


In practice the data will generally be available at equidistant 
intervals, and since we may choose our origin where we please, 
the problem is therefore reduced to one of finding the second 
difference curve which passes through the points (o, «o), (i, Mi), 
( 2 . “a)- 

Having found the equation of the curve we can then find the 
area enclosed by the curve, the extreme ordinates and the x-axis, 
by integrating the function thus obtained between the limits o 
and 2 . 

Assume therefore that y*=u,‘=a + bx + cx* is the required 
equation. 
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Then, by substitution, 


«! = a + A + c, 
ttj = a + 26 + 4c, 


whence 


a = Un 


A = H - «i 2 + 4“i - 3“o) = ^ A*«o. 

c = I '(u^ - 2Ui + «o) = 2 


Also J u^dx= j (a + iA: + cx^) dx 


= ax + lhx^+ Icx^ I 


■I 


= 2<2 + 2 A + |c 

= M“o + 4 “i + “ 2 ). 
on substituting for a, A and c. 

This is Simpson’s rule for approximate integration. 
We can obtain this formula by alternative methods. 
For example, let 

«j. = (i +A)® Uo=«o+a;Amo + ^jc (*- i) S'\ 
as far as second differences. 

Then f Uj.dx=[ {uQ + xS.UQ + lix{x—i)S}u^dx 
Jo Jo 




= {«o + X (Amo - 1 AX) + tix 


= j^ocMo + (Aug - iA\) + \xPA\ 

= 2«o + 2 (Amo - ^ AX) + 

= 2 U(,+ 2 AUo+|-AX, 
and since Ug = Ug, 

«i = Mo + Amo, 

«2 = Wo + 2 AWo + AX, 

this reduces to | (ug + 42/^ + ^2) as before. 


.1 


2 

Jo 
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A third method is to assume that the integral can be expressed 
in the form 

mufi + nui-^ pu^. 

Then if = a-\-bx + cx^ we have eventually that 
'2 

Uj.dx=2a + 2b + fc, 


j: 


as in the first method. 

Substituting in the assumed expression for the integral: 
ma + n [a + b + c) {a-^2b + ^c) = 2a + 2b + |c. 

Whence, by equating coefficients of a, b, c and solving the 
resulting equations. 


m~ 


_ I . 


Tf» 


«=|; 


= i 


3. Change of unit. 

The formula J Uxdx = \{u^ + 41/5 + is an approximate formula 

obtained by considering unit intervals. If we wish to transform 
the formula to a form in which the interval is changed to, say, n, 
then the given values of will be u^\ u„; u^n- Our new variable 
is z, where z = nx. 

.’. dz = ndx. 


The new limits are evidently o and zn. 
The formula is 


r2nj 

Jo n 


«.<^* = H“0 + 4 »n + “*n). 


l.C. 


r2n 


or, on changing the variable to Xy 


r2n 

1 ^ «^d*=Jn(ao+ 4 «„+«a,»). 


This principle is of universal application and any formula of 
approximate integration can immediately be transferred from unit 
intervals to nthly intervals and vice versa. 


VM A8 


12 
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For example, the approximate formula 


becomes 


^10 

/•20 

1 ^ = 5 («2 + «8 + Wl 2 + « 18 ) 


on doubling the interval ; 


pOn r„ 


when the interval is n. 

4. Change of origin. 

Consider again Simpson’s rule: 


u^dx^\ (uo + 4Mi + ttj). 


We have obtained a formula which gives in terms of m, and 
the area of the curve cut off by the ^ 
ordinates a: = o and a:=2. If we y. 

change the origin so that the point ’ 

(i, Ml) becomes the point (o, Vg), as 
in Fig. 5, we shall have a formula 
of integration between the limits — i 
and I in terms of v_i, Vg and v^. ^ v., ^ 

The formula will be otherwise un- ’ ’ 

altered and we shall have found an 
approximation to the same area with 

reference to a new j-axis Oj Fj. [_ 

In its new form the formula be- ^ ^ 


J v^dx = \{v_i-^a,Vg-\-v^, 


or, changing back to u’s. 


J = i («-i + +“o + «t)- 
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Now when it is desired to obtain an approximate integration 
formula it is evident that integration between — i and i (or between 
— n and n) involves much simpler algebra than integration between 
*0 and 2 (or between o and zn). Thus, if 

u^ = a + bx^- cx®, 
then = 

= a — 6 + c, 

«, = a + fe + c, 

and i (u_i + 4U0 + Wi) = J ( 4 «o + («_i + Ml)} 

= ] (4a + 2fl + 2c) 

= J (6a + 2 c). 

Also j* (a + 6 x + cjc®) dx = + iix® + I r x® J 

= 2a + fc, 

which proves the approximation. 

Again, since in this form the coefficients of odd powers of x 
disappear in the definite integral and also in the paired terms u, 
and M_,, we could equally well employ as our assumed expansion 
for Uj. the third difference function a + bx-\- cx® + dx^. 

We should have 

4 M(, -\-{u_i + u^ = ^a + a-b + c — d + a'^b + c + d = ba + zc 
as before, and 

4- + \cx^ + = 2 a + f c. 

This leads to the important fact that Simpson’s rule is true to 
one more order of differences than was originally assumed, i.e. 
to third differences. 

Generally, forjnulae involving 2r + i terms, placed symmetrically 
with reference to the centre of the range, are correct to (2r-f-i)th 
differences. 

For the above reasons, namely 
(i) for simplicity in working; 



i8o 


FINITE DIFFERENCES 


(ii) to enable us to find the true order of differences to which 
the approximation is correct; 

it is advisable to integrate between — n and n in preference to any 
other limits. 

This can always be done by a suitable change of limits; for 
example, integration between J and i can be simplified to integra- 
tion between, say, 2 and 4 by increasing the interval from ^ to 2. 
Then, by changing the origin, a formula can be obtained for inte- 
gration between — i and i . To express the formula in the required 
form the process can then be reversed. 


Example 1. 


r I 

Show that J v^dx = (vj + 22f>j +^3) approximately. 


I 


2-5 j 

v^dx^ - 

1-5 2 


Ujdx (if we double the interval). 


This will produce a formula in 1/4; Ug. 

In the first place we will obtain a formula for j u^dx. 

This formula will have to be of the form + ««o +/>«2, since 
we have moved our origin to the point where x was originally 4. 


Let 

Then 

and 
so that 


u^ = a+bx + cx^ + dx^. 

Uo = a, 

tt _2 = a — 26 + 4c-8d, 
u^ = a + zb+^ + 8d, 

^ («_2 + 22tto + «2) = (22fl + za + 8c) = fl + Jr. 

1 




u,dx = 


! + + =za + lc. 

^ (tt_j + 22ao + Mg) = M u^dx. 

24 2 j_i 

Reverting to the first origin ; 

I I f ^ 

- (tt, + 22«« + U,) = - U^dx. 

24 2^3 
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Moreover, although only three values of w* have been given (so that 
only second differences are known), the formula is true to third 
differences, for it is symmetrical about the middle point. 


6. Some well-known approximate integration formulae. 

(i) Extension of Simpson’s rule. 

We have f u^dx = \ (u„ + + Mj). 

J 0 

Therefore by changing the origin 

'4 


j; 


Uidx = ^ (u^ + ^u^ + uf). 


r® 

Similarly J u^dx = ^ (u^ + 4.u^ + u^). 


In general, by addition, 

j Uj.dx = ^ (Uo + 4 Mi + 2U2 + 4U3 + 2«4 + ... +M2„). 

This is a formula for approximate integration used extensively 
in practical mathematics. In engineering problems, where the 
curve to be integrated has actually been sketched as the result of 
experiments, a small unit may be chosen and a large number of 
ordinates drawn. In that event the extended formula can be ap- 
plied. This will in general give better results than the single formula 
r 2 n ^ 

M^rfx = -(Wo + 4«n + “2n)- 
Jo 3 

It should be noted that Simpson’s rule, applied in sections (as above), 
does not assume that a smooth curve can be drawn between all the 
points Wq, Uj, ... U2„. The method of obtaining the formula has in fact 
been to draw a number of disjointed curves between «q, Mj, u^ ; u^, u^, ; 

... «2n-i> “2n. ^nd the curve passing through three points such as 

«o, Mj, Mj will not as a rule pass through the next series Wj and u^. 

(2) The “three-eighths" rule. 

The following symmetrical formula can be derived by working 
on the above lines, when four consecutive points are given. 
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If the four points were M2, Mj we should commonly in- 

tegrate between o and 3. Change the unit and origin so that we 
pass firstly to the limits o and 6 and secondly to the limits — 3 and 3. 

We require /: u^dx, given «_3, u_ 1, Mj, Mg. 

Let Uj. = a + bx + cx^ + dx^, 

so that u_i + Mi = 2 (a + c) 

and m_ 3-|-M3 = 2 (fl + qr). 

If J u^dx = m(u_^ + Uj) + n{u,_s + Ui), 

we have 

+ ^bx^ + ijcx^ + \dx^ = »i {m_ , + Mj) + n (u_g -I- u-i) ; 

i.e. 6 m + i8c = m (M_i + Mj) + n (m_3-I-M3) 

= OT {2a + 2c) + n (aa+iSc) 

= a (2m + 2n) + c (2m + 1 8n), 
from which, easily, m= 18/8 and m = 6/8. 

J ^U,<i* = |{6(M_i-t-Mi)-|-2(M_3 + U3)}, 
r® 

i.e. J^Mj<ir = |{6(z/2 + U4) + 2 (Mo + Mg)}, 

or u^dx = I {3 («i -I- Mg) + (Mg + M3)} 

= I(«o+3“i + 3«2 + “s) 

= 1 (H-£) 3 mo. 

This formula, like Simpson’s rule, is correct to third differences. 
(3) Weddle’s rule. 

Suppose that seven equidistant ordinates are given. In order 

to reduce the algebra to a minimum, we integrate between limits 

-3 and -1-3, and write the assumed formula for u^ thus: 

„ ^ , L„ , , dx {x^- i) e-vi* i) fx (*2- i) (*2-4) 

u- = a + DX-r — I 7 r — 1 

* 26 24 120 

i) (*2-4) 
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This is Stirling’s formula with the constants a, b, c, 
placing the differences of u^. 

Then 


183 

... g re- 


, r bx^ cx^ d jx* e [x^ x^\ 

J-z L 26 6\4 z] 24\s 3/ 


4- -Z- _ 5^ 4 _ /f! _ ^ + 4^*\T 

120 U 4 2 / 720 1, 7 5 3 

= 6a + 54 ~ + -^ W 486 U + g 

\6 72 540/ U20 720/ 280* 


^6a + gc + ^e+^g. 


Replacing a, c, e, g by the differences of in Stirling’s formula, 
we have 

dx = 6 a + qc + lle + ^ 

= 6u„ + 9 A*i/_j + *-5 A^m_2 + A*m ,3. 

If now A^m_i, A*u_2 and A®k_ 3 are expressed in terms of w.g, 
«_2, ... Wj, «3, we shall obtain a formula correct to seventh differ- 
ences. It will be found, however, that the coefficients of the terms 
are large. This is due to the awkward fraction multiplying 
A*u_3. As sixth differences are usually small, the error involved 
in replacing ^^A®«_3 by /j“g^A*«_3, i.e. by A®w. 3, will, in general, 
be negligible. By this substitution the coefficients in the final 
formula will be much simplified. The modified formula will 
involve an error of YJo'A®u_g and will therefore be strictly correct 
to fifth differences only but virtually correct to seventh differences. 

The terms in the expression thus adjusted, namely, 


6«(, + 9A^_i + A*u_2 + ^ A»k_3, 

may be expressed in terms of the u's and collected as shown in 
the following table; 



«-8 





Ui 

W .1 

6 uo 

9 A“u_i 

• 3 A*u_s 

■3 

3-3 

- 1-8 

9-0 

- 13 2 

4 5 

60 

— x8o 
IQ’S 

— 6-0 

90 
- I 3'2 
4-5 

33 

-1-8 

■3 

Total ... 

•3 

1-5 

■3 

1-8 

•3 

I 5 

3 
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Therefore J u^dx is approximately equal to 


•3 m_ 3 + i- 5 m _2 + -3 u_i+ i- 8 mo+- 3 Mi+ i- 5M2 + o- 3 Us 

= Tff [(«-s + “a) + 5 (“-a + “a) + («-i + «i) + 6 mo] 
or j^u^dx = -^ [(“o+« 6) + 5 (“i+“6) + («a + «4) + 6«s]- 

This is Weddle’s rule. 

An alternative method of obtaining Weddle’s rule is by combining 
a 7-ordinate Simpson’s formula with a 7-ordinate three-eighths formula 
in the ratio 9:— 4. This method proceeds by eliminating the principal 
error terms in each of the two formulae involved. 

(4) Hardy's formulae. 

Certain approximate formulae due to G. P'. Hardy are used 
extensively in actuarial work. 

Let u^ = a -\-bx + cx^ + dx^ -I- ex* + /x® 


so that 




6 a+ i8c-f--?-Se; 


Uo~a; a_2 + H2 = 2a + 8c + 32e; «_3-l-M3 = 2fl-f i8e-)- 162«. 

Solving for a, c and e, and sub.stituting: 

J U^dx = 2-2Mo -I- 1-62 (W_2 + “s) + 28 («_3 + U3) 

or J Mj.dx = 2-2M3-|- 1-62 (Mi-|-«3)-t--28 (Wo + Wg) 

= •28 («o-l-Me)-l-i-62 (a, -I- Mg) -t- 2-2M3, 

which is Hardy’s “formula (37)’’. 

If the interval of differencing be n, this becomes 


rOn 

Ujx = n{-2S («o + »<8„)+I-62 K + M 5n) + 2-2M3„}. 


Similarly 


ri2n 

U^dx = n {-28 (Mg„ + «,2„)-|- 1-62 (K7n + «lln) + 2-2tt3„}, 

J 6n 


and so on. 



Since 
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^ao p6n ^I2n /*18n. 

I u^dx=\ Uj.dx+\ u^dx+\ u^dx+..., 

Jo Jo J On J 12n 

00 

1 ^ U^dx = n (-28 (Uo + 2Ug„ + 2Ui^ + ...) 

+ 1-62 (u^ + Usn + th„ + ---) + 2-2(u 

3n ^dn * • •)}• 


This is Hardy’s “formula (38)”. 

If we are dealing with functions derived from a mortality table, 
and we choose n so that yn falls just within or just without the 
limits of the table — so that yn is in fact about too — we can write 
the above formula thus: 


/: 


u^dx = n (•28«o+ f62«„ + 2-2«3„+ I-62M6„ + -56 m8„4- I-62M7„). 


In this form the formula is known as Hardy’s “formula (390)’’. 
(See G. King, Life Contingencies, p. 488.) 


6. (i) Since all these approximations give in eflrect the area of 
the curve bounded by u^, the limiting ordinates and the x:-axis, 
they are often termed “quadrature formulae’’, the word “quadra- 
ture’’ being defined as the exact or approximate calculation of the 
area of the square equal in area to that of the given figure. 

(ii) In any of the foregoing formulae the sum of the coefficients 
must equal the range of integration, where this is finite. This may 
easily be seen by making all the w’s involved equal to the same 
constant. 


7. Practical applications of the formulae. 

Integration formulae of the above type may be used to advantage 
in obtaining approximations to the values of certain complicated 
forms of integral which occur in the theory of life contingencies. 
Where the functions involved are such that the summation extends 
to the end of the life table, it is customary to calculate the values 
of the functions by Hardy’s formula (39a). If, on the other hand, 
the upper limit is at an age-point short of the limiting age of the 
life table, any of the simpler integration formulae can be employed 
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to advantage. (See, for example, Spurgeon, Life Contingencies, 
pp. 262, 263.) 

It is evident that we can neither integrate the function u^. nor 
interpolate between given values of if we are absolutely without 
information regarding the value of the function between the 
ordinates. For example, in a mortality table we may integrate 
between, say, and /31, assuming first differences constant, for we 
know that in general the decrements between ages 30 and 31 are 
small and may be fairly considered as being evenly distributed over 
the interval. We could not reasonably adopt the same assumption 
for interpolation between and /„ since the deaths of infants in 
the first year of age are not evenly distributed over the year. Again, 
a reliable estimate of the population of a seaside resort in June of 
any calendar year would not be obtained by a first difference 
interpolation between the population figures for January and 
December of the same year. In the two latter illustrations further 
information would be necessary before we could proceed to inter- 
polation or integration. 

In applying the formulae of approximate integration to the 
solution of a problem it is therefore essential that we have sufficient 
knowledge of the function to justify our assumptions regarding 
the nature of the curve passing through the given points. 

When the function to be integrated is one in which the neglected 
differential coefficients of the curve are small between the limits 
of the integration, almost any of the above formulae will give 
satisfactory results. If these differential coefficients are not small, 
i.e. those of lower order are changing rapidly between the limits, 
there may be considerable differences between the approximate 
results. 

Example 2. 

The formula for a continuous annuity-certain is 



where 8 is the force of interest corresponding to the rate of interest i at 
which the annuity is valued. 
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By means of an approximate integration formula find the value of 
ajl at 4 per cent. 

Since we have to evaluate we shall require some or all of the values 
of Uf from t — o to t = 6 , where a, = e~^‘. 

By a well-known formula in the theory of interest, 

where t; = (i -h i)“i. 

Values of at 4 per cent, are available from any tables of interest. 
We have, therefore: 

= =1, 

= =v = -96154, 

U2 = e~^=v^= -92456, 

U.J — e~-^ = v^= -88900, 

«4 = e~** = t>''= -85480, 

= = -82193, 

ttj = e-** = ?;« = -79031. 

The following results will be obtained : 

(i) Simpson’s rule applied three times ; 

I («o-t-4Wi + 2tt2-t'4a3-)-2«4 + 4Uj-)-ii,) 

= 5-34^'30- 

(ii) The three-eighths rule (with interval of differencing 2): 

l{2 («^■f3«2 + 3«4-^-««)} 

= 5-34629 

(iii) Weddle’s rule: 

•3 K-l-5“i + “2 + 6«3 + M4-f 5"6 + We) 

= 5-3463 1- 


t/'-o® 


1*6 

e~^‘ dt will be 

— 1 e~^‘ 

.lo 

8 


, and since the value of 8 at 4 per cent, as given by an interest 


table is 039220713..., the integral becomes 


1- -7903145257... 
•039220713... 


5-34629.... 


The three approximate formulae all give results which differ very 
little from this. The excellence of the approximations is due to the fact 
that between the limits t = o and r = 6 the higher differential coefficients 
of 3/ = tt( = e~^* are very small and the range is short. 
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8. The Euler-Maclaurln expansion. 

We have considered quadrature formulae in which the integral 
is expressed in terms of a number of particular ordinates. All 
formulae of this type can be deduced in a straightforward manner 
by means of Lagrange’s interpolation formula, as shown at the 
end of this chapter. We will now proceed to investigate the pro- 
blem of approximate integration more generally. 

The basic formula for expressing a definite integral in terms of 
given ordinates, with adjustments, is the Euler-Maclaurin expan- 
sion. This expansion is in effect of similar form to the expressions 
already obtained, a greater degree of accuracy being ensured by 
the addition of functions, not of the ordinates themselves, but of 
derivatives of certain of the ordinates. 

The formula may be derived by the expansion of operators, thus : 


2 /(*)=/ (o)+/(l)-|-. ..+/(«-!) 

= F {n) — F (o), where / (a?) is Ai^ (x). 

Since / (*) = AF (x), 

F(x) = A-VW 

= (e®— \)-^ f{x) (since i 4 - A = e®. Chap. VII, para. 6) 

■ D D* 

I +— 7 +— ? + 


r D 


2! 3! 

„ - r D D* ^ . 

= D-^ 1 + ... / (x) 

L 2 12 720 J ' ' 


=D-V(*)-i/(x) + ^/(x)-|^/(x) ... 

= [/(x) dx- J/(x) + ^ — L .... 

j 12 dx 720 <fx* 
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Between limits o and n, we have therefore 


F («)-F (o)=|V W d*-H/(«)-/(o)}+A {/' («)-/' (o)} 

For (w) — F (o) we may write 

2 f(x) or /(o)+/(i)+/( 2 ) + . ..+/(«-!). 

X — 0 

• Jo / («) ^^ =/ (o) +/ ( J ) +/ (2) + • • • +/ (” - I ) + I {/ («) -/ (o)} 
-A- {/' («)-/' (o)} + Tk {/"' (»)-/"' (O)} - 
= i/(o)+/(i)+/(2)+...+/(n-i) + ^/(«) 

{/' («)-/' (o)} + Tk {/"' («)-/'" (O)} -• 


This is a simple form of the Euler-Maclaurin expansion. 

A more general form can be obtained by changing the origin 
to the point a and the unit of measurement to r, thus : 

T fo+nr 

r f {x) = {a)+f {a + r)-^f {a^2r)+ ... 

+/ (a + rT^r) + J/ (a + nr) - ^ {/' {a + nr) -/' (a)} 

+ ~{r {a + nr)-r {a)} .... 


It will be noted that, since 


X 


= I - + B, 


21 




6! ■■■ 


we can express the coefficients in (e®— i)“* in terms of Bemouilli’s 
numbers. As, however, the resulting approximation formula is to be 
used for numerical computation, it is of advantage to give the coefficients 
their actual numerical values. 


9. The following examples are illustrative of the use of the 
formula. 


Example 8. 

r ^ dx 

Evaluate to five places of decimals. 

Jo i+* 
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Choose a convenient unit, say o-i. Then in the Euler-Maclaurin 
expansion we have 

a-o, n—io, r=o-i, and m- = - — . 

® i+x 

dux_ I <Pu_ 6 
dx~ dx^~ 

if^daciii I I III 


1 

— +...+ 

1-3 

I II 

1 ,_ 

1*9 2 2 

1 1 

O'OOI 

720 

I 

6 

|. 

720000* 

^16 


= • 1:0000 — . - -I . 6 . , 

120 4 720000 16 

■90909 

•83333 

•76923 

•71429 

•66667 

•62500 

•58824 

•55556 

•52632 

•25000 

6-93773 

= 6-93773 ~ -00625 -t" 'OOOOI 
= 6-93I49. 

f ^ clx 

— = ■60111; to five places of decimals. 

Jo !+:>: 

This agrees with the true result (log^ 2) to the required degree of 
accuracy. 

Example 4. 

Find the sum of the fourth powers of the first n natural numbers by 
means of the Euler-Maclaurin formula. 

In the formula 


j ra+nr 

U^dx = iUa + Ua+r + ‘ia^Br+ — 

^ Jo 


r 

+ a o) (tt o+nr"** o) •••> 
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put a = o, r=i, Ux = x*\ then 

rn 

x*dx=i* + 2* + 2^ + -- 


+ («- + + [4-3-2«L 


2 12 

since higher differential coefficients of than the fourth will be zero. 

r*® 

.’Sj 


I.e. 


" II I 

= r* n* + — 2411. 

0 r=l 2 12 720 ^ 


r*= «•*+ «'*+ n“ n. 

r.= l 5 2 3 30 

By proceeding on the above lines it can easily be proved that the general 
formula for the sum of the pth powers of the first n natural numbers is 

’■=” I II I 

£ r*’= — /7'’+^ + pn^-^ ft (ft- i) (6-2) n”"® 

r_i p+i 2 12 ^ 720 ^''^ ' 

| />(/»-i)(p- 2)( />-3)(/>-4) 

30,240 


10. Lubbock’s formula. 

The previous formulae have been developed for the purpose of 
relating a definite integral to the sum of a number of weighted 
ordinates at finite distances apart. We have, in effect, obtained 
approximate formulae for the value of 

Lf (“a + “a+A + “a+ZA + ■ ■ • + Mft). 

h — y 0 

In addition to formulae of this type we can find expressions 
which enable us to find the value of the sum of a number of or- 
dinates at finite distances apart in terms of the ordinates at greater 
or less finite intervals apart. Thus, if for the curve y = u^ there are 
m unit intervals, so that we have from a: = o to x — m—i, we 
may develop a relationship between this sum and the sum when 
the intervals are, say, h, where nh=i. 

In place of the sum h (u„ -f- -I- + • • ■ + Ma), we may 

consider, without loss of generality, the simpler form when the 
series commences with Uq. 

Let there be m ordinates at unit distances apart, and let each of 
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these unit distances be divided into n equal parts, so that the new 
ordinates are 

^)> *^l/n> • • • ^rlnt • • ■ l/n‘ 

We have 5 s «,, + “lyn +«^» + ••• + “«-!/„ 

E”^-i A 1 ^ 


By ordinary algebra, putting h for i jn, 

A___ A A 

— I ~ ( I 4- A)* — I “ AA + A(2) A^ + . . . 


I A — I. A^— I.. A^— i.g Vi 

E- A + -Ai^ A*...,>. 

A 2». I2|v 24 Vi 


SO that the operator is 


w-i . n*— I n^-i .. Nt 

n + A • ••• ' 

2»i 1 2 Tir 24 n' 


To obtain S we apply this operator to 


£"•-1 


«0 = «0 + “l + --- 


We note the first difference of this is 


the second difference is A«„ — Amq and so on. 

Thus, applying the operator, 

iS = «0 + Kjy„ + a^/n + . . . + Mm-l/n 

= n (aj + ai + . . . + a„_i) + ^ (a„, - Uo) - (Aa„ - Ako) 


■■■■ 

This is Lubbock’s formula. 

The coefficients of higher differences than the second were not 
given by Lubbock and are cumbersome. The terms in A®a and 
A*a are respectively 

(«*— i) (ign*— i) 

720«* 


(A»a„,-AX) 



woolhouse’s formula 
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and 


48 on=' ' “ 


If the interval of differencing be originally n instead of unity, 
Lubbock’s formula becomes, on changing the unit. 


M0 + M1+U2+ ... +M, 


mn—1 


n—1 


W (Mo'f' “I" ^2/1 *^(m— 1)71) (^mn *^o) 




— I 


T-,« + •••• 

1271 24^ 


*11. Lubbock’s original formula involves Au„^, ... and these 

involve m„+i, ‘-C- values of u beyond the range of sum- 

mation ; these u’s may have to be specially calculated or may even 
be completely unavailable. De Morgan {Differential and Integral 
Calculus, pp. 317-18) and T. B. Sprague (J.I.A. vol. xviii, pp. 
309-10) transformed the formula so as to involve only Uf, ... u„. 
If the numerical values (without sign) of the coefficients of the 
differences in Lubbock’s formula be represented by Q, Cg, ..., the 
transformed formula is 

w («o + «, -t- . . . tt„_i) + ^ («„ -Uo)-C, - A«„) 

- Ci (A“tt „^2 + AX) - C's (A 3 m „_3 - AX) - Q (A %„_4 + AX). • • . 
where all C coefficients are negative and the terms involve alter- 
nately the difference betw'een the terminal odd differences and the 
sum of the terminal even differences. The differences A«„,_j, A^„_2, 
A®m „_8 , . . . are the concluding differences of the scheme based on 
Ufl . . . «m, and lie on a diagonal sloping upwards (Sprague, loc. cit. 
P- 310)- 

12. Woolhouse’s formula. 

Although Lubbock’s formula has the advantage that it may be 
used when the function is not capable of expression as a mathe- 
matical expansion — as for example when the data are based on a 
mortality table — there are disadvantages in adopting the formula. 
In the first place, if it is necessary to proceed further than second 


F U A S 


13 
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differences the calculations are heavy, and secondly it may happen 
that the differences converge slowly, so that if we stop at second 
or third differences we are likely to obtain a result differing con- 
siderably from the true value of the function. In the case of a 
mathematical function whose differential coefficients are easily 
evaluated an alternative summation formula can be adopted in 
which differential coefficients of odd order replace the finite 
differences in Lubbock’s formula. 

The formula involving differential coefficients is due to Wool- 
X house, and may be developed directly from the Euler-Maclaurin 
expansion. 

The Euler-Maclaurin expansion is 

rm j 

U^dx^^Uo + Ui + Ui+...+ - — (uj - Uo) 


If the interval of differencing be i /n, the formula becomes 

rm I 

n I dx = -h i ~ {^rn “ ) 

Jo 

If, however, we multiply both sides of the first expression by n 
we have 


/•m fi 

n u^cbc = n{^Uo+Ui + U2+... + luJ--{uJ-Uo 

JO t2 


rm 

Equating the two values ” J 




^Ug + Uiij^ + U^n'^ ^ 720H^ 

= n {^Uq + Ui + U2+ ... J2 




woolhouse’s formula 


^0 ^lln "f" • • • ■)" i (^0 "I" *^m) j (^rn ^0 ) 


72 on® ' 

■=M(«0 + “l + «2+---+«m)-^(«0 + Wm)-^(“m'-“o') 
Re-arranging : 

“0 + “l/n + «2/7i+--+“m = «K + «l + «2+---+«m) 

«— I . ^ n^—1, , ,x n‘— I , ,,, ,,,. 

(k„, -W o) + :^-^(«m -«o )••■■ 

If the unit of measurement be changed to n, we have 

“O + «1 + “2 + • • • + “mn = n (Mo + M„ + M2« + • • • + “mn) 

«-i , s n*— I . , ,, n*— I , ,,, ,,,, 

^ (*^0 "h *^mn) (*^mn ~ *^0 ) "f ^20~ ~ ) ■ ■ ■ > 

the usual form of Woolhouse’s formula. 

It should be noted that, by replacing the derivatives of u by 
their values in finite differences, Lubbock’s formula can be ob- 
tained directly from the formula above. 

In applying these formulae to certain actuarial functions the values of u, 
... at the end of the mortality table will disappear. Woolhouse’s 
formula may then be written as 


„(«0 + «l/r. + «<2/n+---) 

71 

, . n — \ s’®-! , «*— I 

= (Mo + «!-(-...) «0 + r Uo iMo .... 

2 « i 2 n‘ " j2on* 

This is a convenient form for e,\pressing the value of a benefit paid 
at Kthly intervals in terms of the values at intervals of a year. 


13-2 
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13. Other formulae for approximate integradon. 

It will have been observed that in the formulae of the type of 
Simpson’s, Weddle’s, etc., the function of the u’s is symmetrical 
about the central value. If, however, a number of fixed ordinates 
be given and it is desired to obtain an approximation to the area 
of a curve in terms of these ordinates, the resulting form will 
not necessarily be symmetrical. Again, the formula for the area 
of the curve may be related not only to ordinates falling within 
the area to be measured, but to ordinates outside the area. It 
may be noted therefore that although standard formulae are avail- 
able, it is not difficult to devise appro.ximations to fit the particular 
problems under investigation. (See paragraph 15.) 


Example 5. 

If Uj, is of the form a-\-bx + cx\ find a formula for u^dx in terms 

Jo 

of ttfl, ttj and Uj. 

The interpolation formula which involves the terms u^, Uj and is 
“* = *^ + *■^“0+ (»- 1) A*j/o- 


u-dx = 




= «0+H“l-“o)-r2 + 


— A “0 + “r 12 
The required formula is therefore 

[ = (5«o + 8«,-Mj.). 

Jo 

It should be noted that (i) the expression is unsymmetrical in the u’s, 
and (ii) the ordinate u, falls without the area to be integrated. 


Example 6. 

If is of the same form as in the example above, derive a formula 

for u^dx in terms of u_i, «, and m*. 

J 0 

Let K, be expressed in the divided difference formula, thus : 

«, = M_i + (x + 1) A«_i + (^ + 1) (* - 1) 

1 1,3 



ALTERNATIVE METHODS 
*•* ' /*» 


Then 


= «-i + 1 A«-i - 1 A*m_i ; 

1 1/2 


A«-i= , 

1 2 


A*«-i = 
1.2 


2 ’ 


|/{2-(-l)}. 


I 2 

On reduction, the integral is easily found to be 
w(S“-i + 39 “i- 8 «s). 
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It is obviously impossible to quote all the formulae that are in 
current use. Further examples and illustrations of various approxi- 
mation integration formulae will be found in the following sources : 
Whittaker and Robinson’s Calculus of Observations, Chap, vii; 
C. H. Wickens, J.l.A. vol. Liv, pp. 209-13; A. E. King, T.F.A. 
vol. IX, pp. 218-31; Elderton’s Frequency Curves and Correlation, 
3rd ed., pp. 26-8; J. Buchanan, J.I.A, vol. xxxvii, p. 384. 


14. Alternative methods of proof of the formulae. 

It has been stated above that the Euler-Maclaurin expansion can 
be used as the basic quadrature formula. It will be instructive 
to develop Simpson’s formula from this expansion. 


r2T)m ^2 

U^dx = M + «„ -I- + . . . + - — (u^,, - «„') 

Jo 12 


approximately. 

Writing 2/1 for n, but preserving the same range o to zmn, ' 

^2mn 

J U^dx ZTl ( i Wq "t" Mgtj, ^4n ^f2m— 2)n ^^2mn) 

j 2 2mn ^0 )' 

Subtracting this from four times the first : 
r2mn 

3 J 0 u^dx = 2n{^Uo + 2u„ + u^ + 2u^„ + ... + 

+ 2«(8m-l)n + i“2mn) ; 
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i.e. 


r2mn fi 

U^dx = - («o + 4«« + 2«2n + 4“3n + • • • + 2«(2m-2)„ 

Jo 3 

4^2m— l)n ^7nn)» 

which is the extended Simpson’s formula. 


15. Proofs by Lagrange’s formula. 

We have shown in paragraph 2 (p. 176) that we may adopt the 
expression for in terms of «(, and differences of Mq as the assumed 
form of function. Since Lagrange’s formula is based on the same 
assumption it is evident that, given selected ordinates, at either 
equal or unequal intervals, we can obtain any approximate formula 
by the use of the Lagrange formula. 

For example, given Uj, u^. 


(x-i)(x — 2) jcfx— 2) i) 

"z = “0 - + "1 — 7 — V + «2 - 

(-l)(-2) I.(-l) 2.1 

= lUf) {x^ - 3X + 2 ) + (23C - (x** - x), 


so that 


= J (Mo + 4Uj + tt2). which is Simpson’s rule. 


If we integrate between o and i we shall obtain 




= A (5«o + 8 «I - Wz), 


the formula given in Example 5. 

By this method it is easy and straightforward to deduce special 
formulae for particular cases. 


*16. Remainder terms. 

It is known (Chap. Ill, paragraph 17) that if an interpolation 
formula is used to find from n values u„, Mj, ... m*, the error 
involved, or remainder term, is of the form 

d” 
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where ^ is an unknown number falling in the range which includes 
X, a, b ... k. By means of this result, expressions in similar form 
have been found for the errors or remainder terms of the principal 
quadrature formulae. 

For example, if the range of integration is a to b, the remainder 
term in Simpson’s formula is 

- 00035 (i-af — 
and that of the three-eighths rule is 

d* 

-•oooi6(6-flf 

We cannot usually find but if the ma.ximum value of the 
differential coefficient can be found the expression gives an upper 
limit for the error. 

For further information on this point the student is referred to 
Steffensen, Interpolation, Sections 12-16, and Milne-Thomson, 
Calculus of Finite Differences, Chap. vii. 


EXAMPLES 9 

1. Prove that 

r 2a ra 

u^dx=\ {u^->ru^f)dx, 

Jo Jo 

and illustrate the result geometrically. 

2. Jf u^ = a + bx + cx'^, prove that 

j^U,,dx = 2 Ui+ 1^2 (tto - 2M8 + M4), 

and hence find an approximate value for 

f* 

j e 

3. Show that f = (5«i-l-8tto — «i_j) approximately. 

Jo 

Find the approximate mileage travelled between 12.0 and 12.30 by 
use of the above formula, from the following: 
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finite differences 


Time 

Speed (m.p.h.) 

11.50 

24-2 

12.0 

35-0 

12.10 

41-3 

12.20 

42-8 

12.30 

39-2 


4. Prove that, if is a rational integral function of *, then 

{ X ^ 

e® u„dx = ae°^ {i — aZ) + a®Z)“-a®Z)®+ ...) 


wher« D=-r. 

dx 


5. Show that the area of a curve, divided into n parts by « + i equi- 
distant ordinates u^, Wj, ... is given approximately by the series 


nu^ + - 


2 1 1.2 


(n* , 

n^ + n 

u 


4 

] 1 - 2.3 


to n + 1 terms. 


6 . Between the limits * = o and x = n the functions and dujdx 
are continuously increasing. 

rn n-1 

Show that Ugdx is less than i^o-f S Ux + iu„. 

Jo 1 

7. Obtain the approximate formula 

J-i * 12 ’ 

showing up to what order of differences it holds. 

8. Assuming to be of the fourth degree in x, express [ in 

j 0 

terms of Ug, Kj, Uj, Uj and u^. 

9. A plane area is bounded by a curve, the axis of x, and two ordinates. 

The area is divided into five figures by equidistant ordinates 2 inches 
apart, the lengths of the ordinates being 21-65, 21-04, 20-35, 19-61, 
18-75 17-8^0 inches respectively. Apply the method of integration 

to obtain an approximate value of the area. 

10. Prove the approximate formula , 

rlO 

= 2-5 («1 + + Hg + Mg), 

Jo 

and show that the formula involves a small second difference error. 
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f8 

II. Find the value of (i +*:)“* die. 

Jo 

Obtain approximations to the value by applying 

(i) Weddle’s rule: 

f® 

u^dx = 0*3 (wo + SWi + + 6M34- M4 + 5W5 + Me)» 

JO 

(ii) Simpson’s rule; 


(•2 

dx = i («o + 4Mj + iij), applied three times. 

J 0 

12. Which of the two following formulae would you expect to give 
the better approximation for u^dx} 


(a) 0- {5 (“0 + “t) + 4 («i + th)+ i 8 uj, 
W lT{2(“0+“2 + “4)+I2(tti + as)}- 

13. Prove the approximate integration formula: 
rn 

= « {guo + 2 V (l9“n - 5%. + «»n)}. 
Jo 

and hence find [ u,.dx given the following table: 


X o 1 23 

«* 27.650 31.252 35,154 39,368 

14. Prove that 

I / (*) J* = H/ ( - i ) +/ (i)} + ^ (A/ ( - i ) - V (.i)} approximately. 

,-3 

Hence find /(x)dx, when /(o)=io5, /{i) = 2i2, /(2) = 42i, 
' 1 

/(3) = 749 and/ (4) =1050. 

15. If either increases continually or decreases continually as x 

rn u- 1 

increases, prove that u^dx differs from S «j, by less than the differ- 
Ji 1 

ence between Kj and u„. 

Prove that the difference between log n and 

i+i + J + ... + r/(»-i)<i, 
however great n may be. 
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1 6. If third differences are constant, prove that 

u^dx = (tt_j + 23 ttj + 23 ttj + «j). 

Adapt this formula to find the approximate value of log, 2 from the 

, r 

integral — . 

j a ^ 

17. Prove that, approximately, 

j_ = °'3 + 4-4“-2 + 2-5«^i + 4“o + 2‘S“i + 4‘4“2 + 1 ' iWs)- 


18. /(x) is a rational integral function of the fifth degree in x. 
Prove that 

j (^) dx = 1/(0 ) +1 {/ (Vo-6) +/ ( - V 0-6)). 


19. Use Simpson’s rule to prove that log, 7 is approximately 1-95. 

20. Apply the Euler-Maclaurin formula to find a formula for the sum 
of the fifth powers of the first n natural numbers. 

21. Obtain Shovelton's integration formula: 


flO 

u^dx = yf e {8 («o + w,o) + 35 («i + “-j + «<7 + “ 9 ) 

Jo 

+ 15 (’fi: + “r + «9 + “s) + 36«6}- 

22. By means of Hardy’s formula 

f6 

Uxdx= -28119 + 1-62K1 + 2-2U3 + I -62115 + -28115, 

J 0 

calculate the value of 

Jo 

correct to four places of decimals. 


23. If / (x) be a function of the third degree in x, and if 



24. AB is the base of a semicircle, centre O and radius unity. The 
points P and Q bisect OA and OB respectively. The area between the 
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TT ^ 

semicircle, the base PQ and the ordinates at P and <3 is ^ + - . Use 


Weddle’s rule that 

[ f{x)dx^o-j, If (o) + 5/ ( I ) +/ (2) + 6/ (3) +/ (4) + 5/ (5) +/ (6)} 
Jo 

to find an approximate value of ti to three places of decimals. 

25. The following values of are given: 


*0123456 
i/j. -146 -161 -176 -190 -204 -217 -230 

f® 

Use an approximate integration formula to find the value of u^dx. 

Jo 

It is found that, for the values given, 3» = logjo (-05*+ 1-4) fits the 
data. Verify that this is so by integrating logjo (-05*+ 1-4) between the 
limits o and 6. (log,n e = -4343 ; log,,, i-7 = -2304; logjo i'4 = -i46i.) 

I'l 

26. If Mj. is a function whose fifth differences are constant, u^dx 
can be expressed in the form 

pu_^ + gUo+pUa. 

Find the values of p, q and a. 

Use this formula, after making the necessary changes in the origin 
and scale, to find the value of log* 2 to four placeS of decimals from the 
equation j. 1 ^ 

dx = \o%,2. 

Joi+x 


27. Prove that, if a = o, 

ra+r 

u^dx-iu^+u^ + u^+ ... (‘^“r-l-^«o) 

- + A^) - ,^*5 (A»M,_g - AX) 

-rotr(AX-. + AX).... 

If be the function (i find an approximate value for tt. 

r 3 

28. Obtain an approximate formula for u^dx in the form 

.U 3 

a (M_, + tt2) + *(«-a + U8) 
and find the values of a and b. 

29. Use Lagrange’s formula to show that 

l5 1 78J0 + 308 (jv_i -t-jj) - 17 (jy_2 d-jj)]. 



PROBABILITY AND ELEMENTARY 
STATISTICS 

CHAPTER X 

PROBABILITY 

1 - Suppose that a bag contains a hundred balls of exactly the 
same size and shape, of which one is coloured and the remaining 
ninety-nine white. If one of these balls be drawn at random it is 
safe to say that there is a greater probability of drawing a white 
ball than of drawing the coloured ball. Again, of a number of men 
in a community, all subject to the same conditions, the probability 
that one aged 20 will survive 10 years is obviously greater than the 
probability that one aged 90 will survive the same period. These 
examples serve to illustrate what is understood by the term “prob- 
ability”. Many more such simple examples could be given, from 
which we could say that the probability that event A would happen 
is greater or less than that event B would happen. A difficulty 
arises, however, when we attempt to assign a measure to the 
probability that one of the events A or B might happen. 

In the first of the above examples it is not unreasonable to argue 
that since there is one coloured ball among a hundred balls, the 
probability that this coloured ball would be drawn is one in a 
hundred. Whether this be so or not, we have used the data at our 
command and have given a measure to the probability. In the 
second example, however, we have no data immediately available 
to enable us to assign a numerical value to the probability in 
question. It would be necessary to collect statistics of the mortality 
of the men in the community, and to supplement the collection 
of these statistics by more or less elaborate calculations. In the 
application of the theory of probability to most actuarial problems 
the aggregation of statistics is necessary before the data are avail- 
able, and it is not often that we are immediately in possession of 
such simple facts as are given in the first type of question. A com- 
plete answer to a problem in actuarial work would involve the 
collection and interpretation of the relevant statistics, and in many 
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of the questions on probability that are dealt with later it will be 
assumed that sufficient data are available to enable us to proceed 
to the required conclusions. 

2. In the study of probability we are concerned with the number 
of times that an event occurs on the average out of a very large 
number of occasions on which it is in question. The proportion of 
such favourable cases to the total number of cases may then be said 
to be the “probability” of the happening of the event. For example, 
consider the following extract from a mortality table; 


Age 

Number of persons 

who are alive at exact 


age * 

0 

1000 

10 

8 i2 

20 

793 

30 

762 


This table may be interpreted thus. Out of 1000 births, 812 
children survive to age 10; of these 793 survive to age 20; of these 
762 survive to age 30; and so on. Since “living to age 10” is an 
event, and we start with 1000 births, this event is in question on 
1000 occasions; the proportion of times that this event happens is 
812/1000 or -812. 

•812 is therefore the “probability” that, according to the par- 
ticular mortality table given above, a child just bom will survive to 
age 10. Similarly, the probability that the child will survive to 
age 20 is ’793 and that it will survive to age 30 is -762. 

These probabilities are all averages, and being averages require 
adequate data for their calculation. In ordinary circumstances little 
can be argued from an event which happens on one occasion only; 
in probability nothing of value can be deduced from solitary in- 
^stances. In fact, probability as applied to a solitary occasion rarely 
has any meaning ; not only can few deductions be made from solitary 
instances, but in general they cannot be applied to future solitary 
oj^casions even if they have been drawn from sufficient data. 
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The collection and interpretation of the data necessary for the 
calculation of averages and probabilities is the work of the statis- 
tician, but it is the mathematician who is concerned with the laws 
followed by these averages and probabilities. It is worthy of note 
that although probability and statistics are in effect inseparable, the 
theory of probability was evolved from a very different source — 
namely, a consideration of gambling and games of chance. The 
association between the two sciences came at a much later date. 

3. It will be observed that the term “probability” has been em- 
ployed in two senses. It has been used as the name of the subject 
itself, and also as the name of a ratio — the proportion of the number 
of favourable events to the total number of cases when the hap- 
pening of an event is in question. There is yet a third sense in which 
the word is frequently employed, namely in the sense of “credi- 
bility”. Such terms as “credibility” and “likelihood” in their 
ordinary colloquial meaning are incapable of exact mathematical 
measurement and with probability in this sense mathematicians are 
not concerned. 

In this and the following chapters the term “probability” will 
be used in the sense of an average, but in so doing it must be 
repeated that the study of the subject will be of no assistance in 
forecasting the results of solitary events. 

4. When the cases which constitute the totality of possibilities 
for any event are counted, care must be taken that all the cases are 
strictly comparable. If we are considering the probability of living 
a number of years we must be careful to state whether we are 
dealing with male lives only or with female lives only or with an 
aggregation of both sexes; for the longevity of males is different 
from that of females. Similarly, if it be required to find the prob- 
ability that a coin of particular value is drawn from a bag containing 
a number of coins it must be stated at the outset whether or not all 
coins are equally likely to be drawn. If one of the coins were a 
half-crown and the others were all sixpences, it could hardly be 
said that it would be equally likely that any one particular coin 
would be drawn; the cases would not be homogeneous. 
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This important question of homogeneity arose quite early in the 
history of the subject. Errors were often made in the solution of 
what are now recognized as quite simple problems through the 
neglect of this principle. D’Alembert’s solution to the following 
problem is classic: 

What is the chance that heads will turn up at least once in two 
tossings of a coin? 

The solution given was as follows; 

“Only three different events are possible, namely, 

(i) heads the first time — which makes it unnecessary to 
toss again; 

(ii) tails the first time and heads the second ; 

(iii) tails both times. 

'Fhere are thus two favourable cases out of three, and the 

required probability is therefore 

The three cases of d’Alembert are not equally probable. The 
classification is wrong. This should be 



First toss 

Second toss 

(<») 

Head 

Head 

{b) 

Head 

Tail 

(c) 

Tail 

Head 

id) 

Tail 

Tail 


Of these four cases (a), (b) and (c) are favourable to the event in 
question, namely, “heads at least once’’, and the required chance is 
therefore f. 

5. At first sight it would seem that the examples above have been 
drawn from different sorts of experience. In paragraph 2 a result 
has been obtained from a consideration of a mortality table — ^which 
is a matter of records — while in D’Alembert’s problem we used an 
illustration based on the tossing of a coin — a matter of presumption 
and not of records. The difference between these two kinds of 
phenomena is real, but it does not render them incompatible in 
the study of probability, nor does it destroy the general idea of the 
average value. When we speak of tossing a coin we do not mean the 
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ordinary coin of commerce, but an ideal coin, which is a perfectly 
uniform circular disc with its centre of gravity at the centre of the 
coin. The same ideal conditions will be assumed to be present in all 
subsequent experiments with dice and cards. Thus a die will be a 
perfect cube and in a pack of cards each card will be assumed to be 
drawn as often as any other card in a large number of drawings. 
On these assumptions it is unnecessary to refer to records ; we base 
our arguments on the definitions of the objects themselves. The 
illustrative examples that can be drawn from a consideration of the 
ideal objects are most suitable for demonstrations of the principles 
of probability ; examples obtained from statistical records generally 
involve large and unwieldy figures. 


€. Definitions of the probability of an event. 

(а) The “unitary” or “a priori” definition. 

If an event can happen in a ways and fail to happen in b ways, 
all these ways being equally likely and such that not more than one 
of them can occur, then the probability of the event happening is 
a/(a + b). 

If the probability that an event £ will happen be denoted by p, 
then p = al(a + b); this is called the “a priori” definition of prob- 
ability because we have reasoned from general considerations and 
not from statistical observations. Since, however, we have inserted 
the proviso that all the ways of happening are equally likely — or, 
to put it in another way, if the experiment were continued a very 
large number of times the different cases would occur with equal 
frequency — then we may say that p = a/(a -t- b) is the proportion or 
average of the number of times that the event E will happen. 

(б) The “statistical” definition. 

If on taking any very large number N out of a series of cases in 
which an event E is in question, E happens on pN occasions, the 
probability of the event E is said to be p. 

This definition is of greater generality than the “a priori” 
definition; it gives the practical interpretation that the probability 
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of an event is a relative frequency. Further, it suggests that the 
probability arises from statistical investigation. 

The meaning of the statistical definition has already been ex- 
emplified by the probabilities of survival in paragraph 2. Further 
illustrations will be given later by reference to a correlation table. 

7. From the definitions above we may immediately deduce the 
following facts : 

(1) The probability of an event is a ratio. 

(2) The ratio can never exceed unity. 

The measure of certainty is unity. If an event is certain 
to happen the probability of its happening is r; if it is 
certain not to happen the probability is o. 

(3) The sum of the probabilities of the happening and of the 

failing of an event is unity. 

(4) If the probability that an event will happen is known, the 

probability of its failing is also known. 

Instead of using the fraction aj{a + b) for the probability of an 
event, we may use an alternative form: if p — al(a + b) and, there- 
fore, I —p — bl{a + b), then the odds in favour of the event are said 
to be a : b. 

8 . The simplest problems in probability are those in which both 
the number of favourable ways and the total number of ways in 
which the event may happen can be counted, either arithmetically 
or by the aid of elementary rules. The enumeration is often best 
performed by the application of the theorems of permutations and 
combinations, and a thorough knowledge of this branch of algebra 
is essential for the speedy solution of many of these questions. 

Example 1. 

An ordinary pack of cards contains 13 cards of each of four suits: 
spades, hearts, diamonds and clubs. Ten cards of each suit are numbered 
I (the ace), 2,3 ... 10, the remaining three being “ court cards ”, namely, 
knave, queen, king. 


F M A t 


»4 
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Find the chance that if a card be drawn at random from an ordinary 
pack it will be a heart. 

Since there are 52 cards in a pack, the number of ways in which a 
card can be drawn (i.e. the total number of ways in which the event can 
happen) is 52. 

The number of ways favourable to the event is 13, since there are 
13 hearts in the pack. 

The required probability is therefore ^ or 

Notes, (i) The above result means that over a long series of trials the 
proportion of favourable draws would approximate to one-fourth of the 
whole number of trials. It is convenient, however, to think in terms of a 
single trial, although the results of our calculations merely epitomize the 
experience of a long series of trials. 

(ii) The word “chance” may be taken to be the same as the word 
“probability”. There is a shade of difference between the two ideas, 
“probability ” referring to past, present and future events, and “ chance ” 
to future events only. This distinction can be ignored. 


Example 2. 


Three cards are drawn at random from an ordinary pack. Find the 
chance that they are a king, a queen and a knave. 


The total number of different ways in which three cards can be drawn 
is and all these ways are equally likely. 

The number of ways in which a king, a queen and a knave can be 
drawn together is 4®. 


4^. 3 ! 

The required chance is therefore — — or 

52.5I-50 


16 

5525' 


Example 3, 

Out of 2« + 1 tickets consecutively numbered, three are drawn at 
random. Find the chance that the numbers on them are in arithmetical 
progression. 

This problem can be solved by virtually writing down the sets of 
numbers that can be in a.p. Consider, for example, the sets of three 
numbers in a.p. beginning with 4: they will be 

4;S; 6 4; 6;8 4 ; 7 ; 10 ... 4 ; n + i ; 2n-2 4:» + 2;2w 

where the highest number that can enter into any set is zn. There will 
evidently be n — 2 such sets of numbers (being the number of numbers 
from 5 to « + 2, both numbers inclusive). We can therefore write down 
the following schedule; 
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Lowest number of the three 

1 

2 

3 

4 

5 

6 


Number of favourable ways 
n 

n—1 
n — i 
n-z 
n — z 
«-3 


zn — z I 

2« — I I 


The total number of favourable wa5rs is gj 


1 since 


the total number of ways of drawing three tickets is ^ ^ 


it is easily seen that the required probability is — . 

Example 4. 

If the letters of the word REGULATIONS be arranged at random, 
what is the chance that there will be exactly four letters between the R 
and the E? 

The conditions are satisfied if 


the R is 

in the first place and the E 

in the sixth, 

„ R 

,, second „ 

E 

,, seventh, 

„ R 

„ third „ 

E 

„ eighth. 

R 

„ fourth „ 

E 

„ ninth, 

„ R 

„ fifth 

E 

„ tenth. 

R 

„ sixth 

E 

„ eleventh. 


or if the R and the E are transjiosed in any of the above. 

Now there are n letters in the word; therefore the letters R and E 
can jointly occupy 1 1 x lo positions. Since there are 6x2 favourable 
positions (as above) the chance that the R and the E are in the favourable 
position is or 


Example 5. 

Find the probability that the number 8 will be thrown in a single 
throw with two dice. 

The total number of numbers that can be thrown with two dice is 
36, since each of the 6 ways of throwing the first die can be associated 
with each of the 6 ways of throwing the second. 


14-2 
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The number of ways favourable to the event can be found by simple 
enumeration, thus; 

First die Second die 

2 6 

3 5 

4 4 

5 3 

6 2 
Total: 5 ways. 

The required probability is therefore ^ . 

9. Let us consider the simple question ; if a die be thrown, what is 
the chance that an ace appears.^ 

There are 6 equally likely cases, one of which is favourable to the 
event. Therefore, by the unitary definition, the required chance 
is J. 

Again, by the statistical definition, we may say that since each 
face is as likely to appear as any other, in a very large number of 
throws (N) the ace will appear Njb times ; the required chance is 
therefore as before. 

Whichever definition is employed it must be borne in mind that 
the probability J denotes the proportion of times that the ace would 
appear out of a very large number of throws, and that we are quite 
unable to predict on which occasions in a series of trials the ace 
will be presented. This leads us to examine more fully the meaning 
of the word “single” in such a question as “what is the probability 
of throwing an ace in a single throw with an ordinary die ? ” This 
point can best be explained by reference to an illustration similar 
to the first illustration in paragraph i. 

If there be five exactly similar baUs in a bag, two coloured and 
three white, then by the unitary definition the probability that a 
coloured ball will be drawn (supposing one ball be drawn at 
random) is f. This does not mean that if a ball were drawn five 
times, being replaced each time, two of the draws would necessarily 
give coloured balls. The true interpretation of the fraction f is that 
if the experiment were repeated a large number of times a coloured 
ball would be drawn on the average twice in five times. 



EXAMPLES BASED ON A CORRELATION TABLE 213 

The meaning to be attached to the word “single” in the original 
question is that the records are to be per sets of one throw. 

In a similar manner, we may define the words “the next throw”. 
Suppose that it is required to find the chance that in two consecutive 
throws of a single die the numbers 2 and 3 will appear. The word 
“next” refers to a specific order, and what we set out to find is 
the proportion of times that the order 2, 3 appears in a very large 
number of trials. 


10. It has been remarked that illustrative examples in probability 
can most easily be given from a consideration of ideal objects, such 
as coins, dice, cards, etc. Examples from statistical records involve 
the collection and tabulation of statistics, and, moreover, often 
necessitate lengthy calculations. It is instructive, however, to con- 
sider the results that can be obtained from certain statistical records, 
and the following examples, based on what is known as a correlation 
table, are of interest. A correlation table may be regarded as a 
table giving the values of / (a;, y), where x and y are two variables 
which it is known or suspected follow some kind of law of variation. 
In the table on p. 214 the a; variable is the age of husband, while the 
y variable is the age of his wife. The data are taken from the Census 
of 1921 and give to the nearest thousand the numbers of husbands 
in the various age-groups shown down the side with wives of the 
ages in the groups shown along the top. Thus, the figure 86 along 
the line (45-) and in column (50-) means that there were, to the 
nearest thousand, 86,000 husbands aged between 45 and 50 at the 
time when the Census was taken who had wives aged between 50 
and 55. It should be noted that quinquennial age-groups have been 
taken in constructing the table, although more accurate results 
would be obtained by the use of individual ages. Apart from other 
considerations, it would, however, be impracticable to give for the 
present purpose a table at individual ages, and the results obtained 
below may be taken as sufficiently accurate. We may make the con- 
venient assumption that each husband has the age represented by 
the mid-interval value, and similarly for each wife. In other words. 



1921 Census 

Number of married couples for quinquermial age-groups of husband and wife (looo’s) 
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the number 86 may be read as indicating that there are 86,000 
couples of exact age 47J for the husband and 52^ for the wife. 
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It should be noted that the values in the table are termed /re^uen- 
cies, since they show how often that particular conjunction of ages 
occurred in the tabulation of the data. (See Chap. XI, paragraph 2.) 

We will now apply the figures from the table to answer some 
typical questions involving probabilities and averages. 

Note. The results will not be 'affected if the figures are read as they 
stand. In effect the values are all divided by 1000. 


Example 6, 

Find the probability that a husband aged 40-45 will have a wife 

(i) under 40 years of age, 

(ii) over 45 years of age, 

(iii) between 40 and 45 years of age. 

(i) We have to find p such that 

pN — numhtr of husbands aged 40-45 having wives under age 40, 
.V = total number of husbands aged 40-45. 

4 + 22 + 84 + 201 401 . 

(ii) As above, 

= number of husbands aged 40-45 having wives over age 45, 
A' = total number of husbands aged 40-45. 


p=pNIN= 


99+IS+3 + I 118 


964 


= -^ = •122. 


964 


(iii) 

pA^= number of husbands aged 40-45 having wives aged 40-45, 

A^ = total number of husbands aged 40-45. 

Note. The sum of the three probabilities obtained = = * ■ 

This is as it should be, for the three cases exhaust all those in which the 
husband is in the age-group 40-45. 


Example 7. 

Find the probability that a wife aged over 50 will have a husband aged 
less than 50. 
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If p is the required probability, then 

= number of wives over age 50 having husbands less than age 50, 
A^= total number of wives over age 50. 

P = pN N=-^ 2 

^ 676 + 497 + 343 + 215 + 111 +44+ 12 + 2 1900 

= -067. 

Example 8. 

In a sample of 10,000 husbands between ages 30 and 50, how many 
might be expected to have wives less than 40 years old? 

We require io,ooop, 

where N = number of husbands in the table between ages 30 and 50, 

/>A^= number of husbands between 30 and 50 having wives less 
than 40 years old. 


io,ooop=io,ooopNIN 

_ 1+64 + 390 + 864 + 9 53 
906 + 977 + 964 + 916 


X 10,000 


22,720,000 

3763 


= 6038. 


11. Mutually exclusive events. 

If an event can happen in more than one way, all ways being 
mutually exclusive, the probability of its happening at all is the sum 
of the probabilities of its happening in the several ways. 

This proposition is sometimes known as the addition rule, and 
is fundamental in the application of the theory of probability. The 
rule may otherwise be stated thus ; 

If pi, Pj, Pa, ... Pn are the probabilities of n mutually exclusive 
events, the probability that one of these events happens on any 
occasion in which the n events are in question is 

Pi+Pa+P 3 + ... +p„. 

The words “mutually exclusive” in this proposition are of the 
utmost importance. A proof of the rule depending on the statistical 
definition of probability will show that unless the events are 



MUTUALLY EXCLUSIVE EVENTS 


217 

mutually exclusive we cannot add the simple probabilities. Thus, 
if there be a large number N out of a series in which the n events 
are in question, the first will happen on p^N occasions, the second 
on occasions, the third on p^N occasions, and so on. Since the 
events are mutually exclusive, one and only one of the events can 
occur on any one occasion. Therefore, out of N occasions, one or 
other of the events will happen on p^N+p^+p^N+ ... +pJSl 
occasions. The probability that one of the events happens is there- 
fore {piN+p^ +pj^ + ... +pnN)lN or />i+/>2+/>8+ ... +/)„. 

It will be seen that this proof breaks down and the proposition 
is not true if any one event happens on any one of the occasions 
that any other happens, i.e. if the events are not mutually exclusive. 

It follows easily that if instead of keeping in view separate events 
we consider the various ways in which a single event may happen, 
the probability of the happening of the event is the sum of the 
probabilities of the mutually exclusive ways in which the event 
may happen. ¥or, if there he fi dtHerent ways in which the event 
may happen and p^ is the probability that the event happens in 
the rth out of the n different ways, then if the event must happen, 

f»n 

T, p^N is the total number of occasions on which it happens in 

r — 1 

N trials. Since the event always happens, I,prIV=,\\ so that 

I.pr=l. 

The whole number of ways in which an event may happen is 
sometimes called the universe of the event. If an event may fail, 
the failing of the event is also included in the universe. 

The following example illustrates the principle outlined above. 

Example 9. 

What is the chance of throwing a total of 3 or 5 or ii with two dice? 
The three events, i.e. throwing 3, throwing 5 and tlirowing ii, are 
mutually exclusive, since the occurrence of any one of them precludes 
the occurrence of any other. 

We require the proportion of times out of a large number of trials in 
which one or other of the three events will happen. 

Let there be a large number of trials .V. Then since the chances of 
throwing 3, 5, ii are ^ respectively, totals of 3, 5, ii will be 
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obtained on -^N, -^N, occasions respectively. The total number ol 
times on which one or other of the events will happen is therefore 

3 %N+:^N+^Nor fiV. 

Hence, by definition, the probability of throwing a total of 3 or 5 or 1 1 
is f , which is the sum of the separate probabilities of the three mutually 
exclusive events. 

In practice it is sufficient to apply the addition rule at once and to 
write the solution as + + ^ = 


12. Compound probability: independent events. 

If there be a series of events such that the happening of any one 
of them in no way affects the happening of any other of them, the 
events are said to be independent. 

The probability that two independent events happen on any one 
occasion on which they are both in question is the product of the 
chances of their happening severally. 

For consider two independent events E and £', the probabilities 
of which are p and p' respectively. Out of a large number N in 
which the events are in question E w'ill happen on pN occasions, 
and out of these pN occasions E' will happen on p' {pN) or pp' N 
occasions. The probability that they both happen is therefore pp' . 

Similarly, it may be shown that if there are n independent events 
the respective probabilities of which are pi , />» > />3 > • • •/’«) the 
probability of the joint happening of them all is pip-^pz Pn- 

We may use the a priori or unitary definition of probability to 
prove this proposition. 

Let there be two independent events the probabilities of which 
are p and p' and let p = aj{a + b), p' = a' j{a +b'), where a and b 
are respectively the numbers of favourable and unfavourable 
equally likely ways in which the first event can happen; and 
similarly for a' and b' with regard to the second event. 

Then there are {a + b) {a + b') equally likely ways in which the 
two events are jointly in question. Of this number there are aa' 
equally likely ways favourable to the joint happenings of the two 
events. 
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Hence, by the unitary definition the probability of the joint 
event is 


aa' 

{aTb)~{V+V) 


a a , 

a-\-b'a'^-b' 


= P, 


where P is the required probability of the joint event. 

In a similar manner we may prove the proposition for n inde- 
pendent events. 

This rule is known as the multiplication rule, and its application 
to the probabilities of happening or otherwise of two independent 
events may be seen from a consideration of the following schedule. 

Let the probabilities of happening of two independent events be 
p and p' respectively. Then 

(i) the chance that they both happen =/>/>'; 

(ii) the chance that the first happens and the second fails 

=p(i-p'): 

(iii) the chance that the second happens and the first fails 

=P' 

(iv) the chance that they both fail = ( i -p) ( i -/>'). 

The total of these chances 


=/»/>' +P (i -P')+P' (I -/>) + (! -P) (I -P') 

— pp' +p —pp' -j-p' -pp' + 1 -p-p’ + pp' 

= 1 , 

which is obviously true, since we have exhausted all the possibilities. 


13. The application of the principles enunciated above can best 
be appreciated by reference to actual examples. 

Example lo. 

Find the chance that (i) three heads, (ii) two heads and one tail will 
turn up in three successive spins of a coin. 

(i) The chance of a head at the first spin = J 
» „ second „ 

,, ,, third ,, — ^ 

Combined chance = (^)® = 



220 


PROBABILITY 


(ii) The chance of a tail is the same as that of a head, so that the 
total chance might appear to be (J)* as before. This is in fact the chance 
that 2 heads and i tail occur in a specified order, say, HTH. The con- 
ditions of the question would, however, be satisfied if the order were 
HHT or THH. These three favourable events are mutually exclusive. 
The chance that the two heads and tail are obtained in one of the three 
orders is found by adding the three probabilities. The required chance 
is therefore 3 x ( = |. 

Example 11 . 

Q is the probability that a man aged x will die in a year. Find the 
probability that out of 5 men. A, B, C, D, E, each aged x, A will die in 
the year and be the first to die. 

The chance that a given man dies in the year = Q. 

The chance that a given man does not die in the year =; i - 0 . 

The chance that none of the five dies in the year =(i — Q)®. 

The chance that at least one man dies in the year = i — (i — Q)^. 

Since the chance that A is the first to die is obviously and since 
this is independent of the chance that at least one man dies in the year, 
the required chance = J { i - (i — Qf}. 

Example 12. 

A throws 3 coins and B throws 2 coins. Find the chance that A will 
throw a greater number of heads than B. 

(a) The chance that A throws 3 heads is (^)^. The chance in this 
case is (i)® x i, since B must obviously throw less than A. 

ip) The chance that A throws 2 heads and i tail is 3 (JP {^), for he 
may throw HHT, HTH or THH. For B to throw less he 
must not throw 2 heads; the chance of this is i — (I)'*. 

The chance in this case is 3 [i — (i)']- 

(c) The chance that A throws i head and 2 tails is 3 (J) 

In order that B may throw less he must throw 2 tails, the 
chance of which is (4)*. 

The chance in this case is 3 (^)^. 

The three contingencies {a), (b) and (<r) exhaust all the cases for the 
required event and they are mutually exclusive. 

The result is therefore (i)*-f-3 (J)® [1 -(i)®] +3 (i)® ot. on simplifica- 
tion, J. 
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14. Compound probability: dependent events. 

If two or more events are not independent we may still apply the 
multiplication rule. Thus, if p be the chance of happening of an 
event, and p' the chance of happening of a second when the first 
has happened, then the chance that they both should happen ispp'. 

We may prove the proposition in the following manner. 

Let there be two events E and E' which are to happen in that 
order. Let the probability of E be p and the probability of E' after 
E has happened be p' . 

Then if the double event is in question on a very large number 
of occasions W, the number of times that E happens is pN — which 
may also be assumed to be large. Out of these pN occasions E' 
happens on p' (pN) occasions. Therefore out of the total occasions, 
N, there are pp'N occasions on which the event E' follows the 
event E. It follows that the probability of the double event ispp'. 

By similar reasoning we may show that if there be n events 
El, E2, ■■■ E,^ such that the probability of the (r+ i)th event after 
the previous specified r consecutive events have happened is />r+j, 
then the probability of the happening of the n events is given by 
the product p„. 

Care must be taken to distinguish between independent and 
dependent events. In considering independent events the hap- 
pening of either event does not affect the happening of the other, 
and to find the probability that they both happen we multiply the 
simple probabilities of each event. Where the events are not inde- 
pendent, we must find the chance p2 that the second happens when 
the first has happened before we can apply the multiplication rule. 
For example, if there be two urns, one containing one white and 
two coloured balls, and the other one white and three coloured 
balls, the probability that the combined result of two drawings, 
one from each um, will give the two white balls is ^ x J, since the 
drawing from the first um in no way affects the drawing from the 
second. If, however, we were required to find the chance that in 
two successive drawings from an urn containing two white and 
five coloured balls the two white balls would be drawn (a ball 
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drawn not being replaced) we should reason thus. The chance that 
a white ball is drawn at the first drawing is f ; after this has happened 
there will be left one white and five coloured balls, and the chance 
of drawing the other white ball at the second drawing will be 
The combined chance is therefore f x 

15 . It has been shown in paragraph 12 that the multiplication rule 
can be applied to the probability of happening of any number of 
independent events, so that if p, p', p" , ... be the chances of the 
events happening severally, the chance that they all happen is 
pp'p".... 

Again, if the chance that an event happens in a single trial be p, 
the chance that it will happen in each of r trials is p\ and the chance 
that it will fail in each of, say, (n — r) trials is ( i 

Suppose that it is desired to find the chance that an event will 
happen exactly r times in n trials. 

Let the chance that in any one trial the event does not happen 
be q{ = i ~p)- Then if the event happen in r trials and fail in 
n — r, the combined chance is But of the n trials we can 

select the r trials in which the event happens in ways. The 
chance that the event happens in exactly r trials out of the n is the 
chance that it happens in these r trials and fails in the remaining 
n — r trials. The required chance is therefore n(,.)/>’’g"~’'. 

Now it will be noted that is the term containing p' in 

the expansion of (p + 9)”. The successive terms of this expansion 
will therefore give the probabilities of the event happening 
n, n— I, n — 2, ... n — r, ... times exactly in n trials, 

16 . I’he following examples are illustrative of the principles out- 
lined above. 

Example 13. 

Find the chance of drawing a king, a queen and a knave in that order 
from an ordinary pack in three consecutive draws, the cards drawn not 
being replaced. 

This is an example of the application of the multiplication rule to 
dependent probabilities. 
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If Pi be the chance of drawing a king; 

/)j „ „ ,, a queen, a king having been drawn ; 

p, ,, ,, ,, a knave, a king and a queen having 

been drawn; 

then the chance of drawing a king, a queen and a knave in succession 

IS P1P2P3 ■ 

Now there are 4 kings m the pack, so that the chance of drawing a 
king is 5^. When the king has been drawn there are 5 1 cards left of 
which 4 are queens. The chance of drawing a queen is therefore 
Similarly, the chance of drawing a knave from the remaining 50 cards is 

Pi~ *2 > p2~^i> P3 — 5^0 • 

4 ** 

The required chance = 

[Compare Example 2, where the three cards are drawn at random (not 
in a specified order) and the problem is treated as one in simple 
probability.] 

Example 14. 

A bag contains three balls, one red, one white and one blue X and Y 
draw a ball at random alternately. If X draws the red ball or 1 ’ the white 
ball It is retained. Otherwise the ball drawn is immediately replaced. 
Find the chance that just before the fifth draw is made the blue ball 
only IS in the bag. 

Let R, W, B denote the red, white and blue ball respectively. Now 
the blue ball is the only one in the bag just before the fifth draw if any 
of the followmg have happened ; 


(') 

A 

first draw 

R 

Y 

first draw 

B 

X 

second draw 

B 

Y 

second draw 

W 

(ii) 

R 

B 

W 

W 

(iii) 

B 

W 

R 

B 

(iv) 

W 

W 

R 

B 

(V) 

R 

W 

B 

B 

(vi) 

B 

R 

R 

W 

(vii) 

B 

B 

R 

W 

(viii) 

W 

R 

R 

w 

(ix) 

W 

B 

R 

w 
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= The coefficient of in (i — 4*® + . . . ) + 4Jc + 10*® + . . . 

Q.io.ii „ 15.16.17 \ 

+^— ^ — *»+ ... •••) 

15. 16. 17 4.9. 10. II . 0 

= -5— — -+6.10 = 80. 

6 6 

The required chance is therefore 

Note. We are here dealing with arrangements which would be dis- 
tinguishable if one die were red, one blue, one yellow and one green. 
Selections in an example of this type would give quite wrong results. 

Example 17 . 

Nine cards are drawn at random from a set of cards. Each card is 
marked with one of the numbers i, o or — i, and it is equally likely that 
any of the three numbers will be drawn. Find the chance that the sum 
of the numbers on the cards thus drawn is zero. 

The number of favourable drawings will be the coefficient of ifi in 
the expansion of (ari + x® + *’■)* ; the total number of possible drawings 
will be the sum of all the coefficients in the same expansion. 

The coefficient of a® in +a® + a:i)* 

= „ a* in x~‘ ( I + * + a®)* 

= ,, a* in (i +* + a®)'’ 

= „ a* in (i -a®)* ( I -a)-®. 

Proceeding on the same lines as in Example 16, the required prob- 
ability is found to be . 

18 . The theory of probability was evolved from a consideration of 
games of chance, and many problems dealing with these games 
can be solved by the elementary methods outlined above. No new 
principles are involved; all that is required in attacking these 
problems is a clear understanding of the particular happenings 
that may arise. The following examples are illustrative of the 
methods: the first of these examples is analysed fully, and the 
student should be able to solve problems of the same type by the 
application of similar reasoning. 
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Example 18. 

A and B throw alternately with a pair of ordinary dice. A wins if he 
throws six before B throws seven, and B if he throws seven before 
A throws six. If A begins, show that his chance of winning is If- 
(Huyghens’s Problem.) 

First, let us consider the chances of throwing six or seven with two dice. 
We can find the number of ways of throwing these numbers either by 
actually counting the number of ways (as in Example 5) or by finding the 
coefficients of x® and x’ in the expansion of (x + + x® + x® + 

(as in Example 16). Since the figures are small, the first way is simpler, 
and it is easily seen that six can be thrown in 5 ways, and seven in 6 ways. 
The chance of throwing six in one throw with the two dice is therefore 
jg , and of throwing seven . 

A can win if he throws six the first time. His chance of throwing this 
number is 

He may fail to throw six the first time. He can then win if B fails to 
throw seven at his first throw and A throws six with his second throw. 

The cliance that A fails to throw six is i = 

The chance that B throws seven is and the chance that he fails 
to throw this number is i—xe~ M • 

Therefore the chance that A wins at the second throw is -jf 

If A fails to win at the second throw, he can win at the third throw if 
B has not thrown seven at his second throw. The chance of this is 


31 30 31 30 5 „n,J cr. 

IT Ta W T# -Ja and so on. 

A's chance of winning in the long run, i.e. after a very large number 
of trials, is therefore 


3_0 

■r5ff-3fl-36''‘36-3ff-36-3S-3B 




where r = fi.||. 

If we sum this geometrical progression to infinity, we find that A’s 
chance 

= 5 _L_= i 

36 I —r 36’ 366 61 ' 


Example 19. 

A and B play a match to be decided as soon as either has won two 
games. The chance of either winning a game is ^ and of its being 
drawn What is the chance that the match is finished in 10 or less 
games? 


15-3 
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If the match is not finished in lo games, the following must occur : 

(i) All the games must be drawn ; or 

(ii) A or B must win one game and the remaining 9 must be 

drawn; or 

(iii) A and B must each win one game and the remaining 8 must 

be drawn. 

The chances of these mutually exclusive events are: 



since this result may occur in 10 different ways and either A or B may 
win; 

since the number of orders in which this may occur is . 

The chance that the match is not finished in 10 games is, by the 
addition rule, 


/^yo /9y I ^ 
\10j \10/ 20 



10.9 


_ 9^-387 
2. 10^® ’ 


i.e. the chance that the match is finished in 10 games or less 
9 ® -387 


= I — : 


2. io‘ 


- = •17 approximately. 


Example 20, 

A, B, C, D each throw two dice for a prize. The highest throw wins, 
but if equal highest throws are made by two or more players, those 
players continue. A throws 9, B throws 7. Find C’s chance of winning 
the prize. 

Consider the following scheme, showing the total number of possible 
ways in which C may win: 
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C may win if 
C throws 12, D less than 12 


Probability 

L ^ 

6* -36 

III* 


C throj^ia, D 12, .and C wins later 
V. > ' JitkU t J ^ 1 

1 * >i f ^ ^ t' 

^ ^ 2 33 

C throws II, D le^ than ii 


I ■< i i 

C throws II, D II, and C wins later 
C throws 10, D less th|Ln 10 j 

C throws 10, D 10, and C wins later 


62' 36 
2 2 1 * 

3_ 3'3 
6^ 36 

i. 1 

6 '^ ' 6^ ■ 2 


C throws 9, D less than 9 and C wins later 
(since ^ has thrown 9) 

C throws 9, O 9, and C wins later 
The total of these chances is ^9"^. 


4 26 1* 
6^ ’36 '2 

111* 
6^ 6^ 3 


19. Examples based on mortality tables. 

A mortality table is a table of the successive values of for 
integral values of x, where is the number of persons out of /q bom 
w'ho attain precise age x. We are not here concerned with the con- 
struction of a mortality table from the original data. For the present 
purpose we may, however, note that 

(i) the table is constructed from data sufficient for us to use 

the N of the statistical definition of probability; 

(ii) the successive values of 4 are not absolute, but relative, 

referring to a common “radix” chosen for convenience 
of tabulation. 

In explanation of this second point it may be stated that the 
radix may be any number and may be taken anywhere in the table. 

• The final factor in each of these terms expresses the chance that C will 
win after having equalled the number thrown by D and/or A. 
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For example, the radix may be 1000 at age 50, so that /5o= 1000; it 
may be 10,000 at age 20 or even 1,000,000 at age o. The purpose 
for which the table is constructed determines the position and 
magnitude of the radix. 

In the following table the radix is 1,000,000 at age o; for the 
purposes of illustration only that portion of the table from age 90 
to the end of the table has been taken. 


English Life Table No. 10 (1930-32). Males 


Age X 

Number living 
at age X 

4 

Number dying between 
ages A.- and *+ i 
l-I 

90 

16090 

4600 

91 

1 1490 

3478 

92 

8012 

2564 

93 

5448 

1841 

94 

3607 

1287 

95 

2320 

873 

96 

1447 

574 

97 

873 

364 

98 

509 

223 

99 

286 

132 

100 

154 

74 

lOI 

80 

41 

102 

39 

21 

103 

18 

10 

104 

8 

8 


In the 4 column we have the number 16090 at age 90, while in 
the same column we have the number 873 at age 97. This does not 
mean that precisely 16090 persons aged 90 wxre observed and that 
seven years later there survived exactly 873 of these persons. The 
numbers of persons actually observed at ages 90 and 97 were quite 
different. We may, however, use the two numbers given at these 
ages to obtain the probability that a person aged 90 shall survive 
to age 97, thus; 

\{p is the required probability, then /)iV= 873, where N= 16090. 
p is therefore /)A/^/N= 873/16090 = -05426. 
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It should be noted that each value in the mortality table is the 
radix at its own age for all subsequent values in the table. For 
example, if 197/^90 gives the probability at age 90 of living 7 years at 
least, so also does /95//91 give the probability at age 91 of living 
4 years at least. Generally 4^_„/4 gives the probability at age x of 
living n years at least. 

The following example is based on the mortality table given 
above. 


Example 21. 

The ages of three men. A, B and C, are 90, 91 and 92 respectively, 
h'ind the following probabilities; 


(i) that A, B and C will be alive in two years’ time ; 

(ii) that one at least of the three will be alive in two years’ time; 

(iii) that exactly one of the three will be alive in two years’ time; 

(iv) that exactly two of the three will be alive in two years’ time ; 

(v) that all will be dead in two years’ time. 

The required probabilities are 


(\\ h ^94 _ 8012 5448 3^07^ 

^ ‘90 ^92 16090 II49O 8012 

= I — (i - 8012/16090) (i — 5448/ 1 1490) (1 —3607/8012)= -855, 

= ’394 on substituting and simplifying. 


*90 *91 \ ‘oa' *90 \ *91/ *92 V 


^92\ ^93 Aw 


‘90 

= •355 


^/^91 42 




= • 145 - 
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Note. If we add the results under (i), (iii), (iv) and (v) we obtain 
unity. These exhaust all the possibilities. The result under (ii), however, 
is not included, and it is instructive for the student to discover why 
this is so. 

The following are further examples involving the probabilities 
of living and dying, not based on the figures in the table on p. 230. 
The same principles are involved in the solutions of these questions. 


Example 22. 

Three men were known to be alive five years ago when their ages 
were 31, 48, 69. Assuming that of 98 males bom together one dies 
annually until there are no survivors, find the chances that 

(1) all are alive now; 

(2) none are alive now; 

(3) one, and only one, is alive now; 

(4) two are alive and one dead. 

The chance that a man aged x dies in five years is 
The required probabilities are 
(0 

(2) ?\-F(T-2V: 

(3) 5 f ■ • A + similar expressions ; 

(4) -M-zr + two similar expressions. 2- 

As the above are the only possible events that can happen, the total 
of the chances is unity and the reader should verify this. 


clearly — . 
98-x 


Example 23 . 

Three men, P, Q and R, are each of exact age 96. Find the chance 
that they will all die at different ages last birthday in the order P, O, R 
given that 

Chance of dying before 
Exact age next exact age 

96 i 

97 f 

98 J 

99 I 

We have to find the chance of their dying in a given order. There will 
be no need therefore to take into consideration the different orders in 
which the men may die. 
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(i) P may die at age 96 Chance = 

Q may die at age 97, i.e. may survive 

a year and die in the following year ,, =(r - 

R may die at age 98 or later; i.e. may , ■ ^ 

survive 2 years and die in the 

following year „ =(i - J) (i -g) i. 

The total chance under this heading is therefore 

i x(i-.l) 5 x(i-i)(i-g) i = /y. 

(ii) P may die at age 96 Chance = 

Q .. 98 » 

R .. 99 .. I- 

Total chance under heading (ii) 

= ix(i-^) (i-l) |x(i-i)(i-§)(i-J) 1 = ^1-^. 

(iii) P may die at age 97 Chance = (1 -|) f. 

Q » 98 » 

R .. 99 .. =(i-i)(i-5)(i-3) I. 

Total chance under heading (iii) 

Required chance = /g + riT +bh = iHi ■ 

20. A simple problem in probability may sometimes be capable 
of more than one reading. When giving the solution to a question 
which may be construed in more than one way it is essential that 
the assumptions on which the solution is based should be stated 
at the outset. 

'rhe two examples given below illustrate this point. 

Example 24 . 

What IS the chance that a hand of five cards contains a pair of two 
like cards of different suits? 

If the problem is to find the chance that of five cards two are to he like 
cards and the other three unlike, a result is obtained which is quite 
different from the chance that there are to be two like cards, it being 
immaterial what the other cards are. Moreover, this second reading is 
capable of two alternatives : there may be two like cards and two other 
like cards (e.g. two kings and two fours) in the five cards ; or there may 
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be three or more like cards — this is not expressly excluded by the 
question. 

It will be instructive before setting down all the chances that are 
possible to examine what would seem to be the obvious solution. 

Let N denote the number of ways of selecting five cards from 52. 


Then 




The probability of drawing exactly two like cards of given denomina- 
tion (aces, say) will be 


4 i 3 ^ 4^47-46 


2 ! 


3 ^ 


or 


2162 


and since there are 13 ditferent car<Is' in eacfi suit it might be thought 
that to multiply the above fraction by 13 would give the result. This 
answer is, however, incorrect, in that there is “overlapping”, for the 
remaining three cards out of the five may be a pair of like cards which 
have already been counted. 

The possible arrangements, with their respective chances, are 

Chance 

48XJ13 
.V 


Arrangement 

(a) 4 like cards and i different 


or 


(6) 3 like cards and 2 like cards 

, c/ / “ ’’ ' 

■’ ivC ,'V 

(f) 3 like cards and 2 different 

■ ' • / i- ’ ' 

I • 5 . V [ , 

1 


4 - 3-2 4-3 

13 X—— X 12 X—;' 

3 1 2 ! 


N u 


‘13 X 


4 - 3-2 


X 4' X 


12. II 


A 


or 


or 


I 


13.12 4.3 4.3 
If - 4 X ^ X - ^ X 44* 0 

' ■ /' 11 ' ■' j "" j'-Tr '« O2! 2! 2! ^ 198 

(d) z sets of 2 like cards, 1 different « ■ — or 




A 


(e) 2 like cards and 3 different 


■*’ 4-3 

X 2 — X 4' X 

2 ! 


'I "I 

12. II . 10 


6 

4165’ 

88 

4165’ 

198 

41^’ 


3 '■ 


f > » r, 


A 


or 


1760 
4165’ 

Tlie reasoning to obtain these chances is straightforward, and it will 
be necessary to examine one only: the others are on the same lines. 
Consider the arrangement in which there are to be exactly two like 


* 44 'because 8 card* are unallowable, being 4 of each of the previous numbers 
chosen. This avoids duplication with ( 6 ). 
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cards and consequently three different ones. Having settled on the 
denomination of the like cards — which can be done in 13 ways — there 
remain 12 denominations from which to choose 3. This can be done 

ways. The two like cards can be selected from the four 

4*1 

(one of each suit) in ways. This does not quite complete the selec- 
tion, for the three different cards may be selected from any of the four 
suits, and this can be done in 4 x 4 x 4 ways. 

Returning now to the analysis above. If the problem is confined to 
the chance where exactly two like cards are to be chosen, the result will 
be (e) ; if there are not to be more than two like cards, two sets of two 
like cards being permitted, the chance will be (tf) + (e); if there are 
no restrictions, and we are given that two like cards at least are to be 
among the five, we shall require the total of the chances {a) to (e). 

These results are respectively iflf , and fx|f . 

The above method of solution is designed to show, in full, the several 
probabilities of the various arrangements that may occur. If, however, 
it is known that there are no restrictions other than that there shall be 
at least two like cards, a simpler method of solving this problem obviously 
presents itself. It is sufficient to calculate the probability that there shall 
not be any like cards among the five; this probability is evidently the 
complement of that required. The probability that there are no like 
cards is 


13 . 12. II . 10. q 


TV 


2112 

41^' 


the complement of which agrees with the third answer given above. 


Example 25. 

A network of wires forms squares like a chess-board. Two spiders, 
starting at the same time from opposite ends of a diagonal, crawl to the 
end opposite them at the same speed and by one of the shortest routes. 
What is the chance that they meet? 

Let the square AQBR be the boundary of the network. Since the 
spiders start from A and B at tlie same time and crawl at the same pace, 
after having crawled for the same length of time they must arrive at 
points on the diagonal QR. They will meet if the points on QR are 
identical, but not otherwise. Also, any point on the diagonal relevant 
to the problem must be an angular point of one of the squares of 
which AQBR is the boundary. The number of squares which have 
diagonals lying along QR is n. 
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Let P be such a point with coordinates {x, y) reckoned from A as 
origin and AR and AQ as axes. Then x +y = n, and x and y are integers. 



The number of routes from .<4 to 5 is for any route consists of n 
steps in the direction A to R and n steps in the direction A to Q. These 
steps may be arranged in any of (an) !/(n !) (n !) ways. As all these routes 
are equally likely, the chance that any one route is taken is i/^"C„. 

But the number of ways of arriving at P will vary according to the 
position of P. The number of ways of arriving at P from A isn [j(x !) (y !) 
or "Cjf (since * +y = n), and this is also the number of ways from P to B. 
Therefore, of the ^C„ routes from A to B, {"C^)- pass through P, and 
the probability that a route passes through P is thus . 

("C ("C 

The chance that the spiders meet at P is therefore 


x—n 

V 


2 ("CJ* 

the chance that they meet at all is • 

v c,y 

Now this result is based on the supposition that all routes from A to B 
are equally likely. It may, however, be argued that once the diagonal 
QR is reached, the event has or has not happened, so that it is un- 
necessary to consider routes beyond the diagonal. On this assumption 
all routes to the diagonal are supposed equally likely. 

Thus, to reach P from A, the hrst spider may take one of "Cj. different 
routes. To reach the diagonal, the number of possible routes, each of 


x=^n 

which is, under the present assumption, equally likely, is 2 "C^.. 

j- -0 

The probability that each spider reaches P is therefore "CJ2’'Cj., and 
that they both do so is the square of this expression. 

Therefore the probability that they meet at all is 


X’^n 

g—0 

2^" ’ 

S"C^="Co-H"Ci+"Cj-H ... -|-"C„ = 2". 


since 
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It should be noted that, while the numerator of the fraction denoting 

the probability under the first assumption — namely "L — cannot 

be evaluated in the general case, the numerator of the corresponding 
fraction under the second assumption can easily be shown to be “"C„. 

The two results are not the same. While a general proof is difficult, it 
is a straightforward matter to show that in a special case the numerical 
values are different. Thus, if n = 8, the probability in the first case is 
about -208 and that in the second about -196. 

21. The method of induction. 

There is a certain type of problem in probability for which it 
is not possible to obtain a solution by the direct methods outlined 
in the preceding paragraphs. In these problems it is necessary to 
find a relation connecting the chance at any stage with that at suc- 
ceeding stages and then to calculate the required probability by 
adopting an inductive process. The difficulty in these questions is to 
ascertain the fundamental relation; when this has been established 
the problem can be solved by the application of algebraic methods. 

Since we have to obtain a relation connecting the probability of 
an event of the nth stage with those at succeeding stages, it is often 
of advantage to investigate the relation for simple numerical values 
of n and then to deduce the general result. The following is an 
excellent example of this method of solution. 

Example 26 . 

Five green balls and sixteen red balls are placed in a bag. A ball is 
drawn at random n times in succession and replaced after each drawing. 
Find the chance that no two successive drawings shall have given green 
balls. 

To satisfy the required conditions the events must have taken place 
as follows : 

At the first draw either a red or a green ball must be drawn : 

R 

G 

At the second draw one of the following draws must have taken place : 

R R R G 

G R 

i.e. first drawing associated with R-,otRG. 
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At the third draw; 
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R R 
G R 
R G 


R 

R 

R 


R R G 
G R G 


i.e. second drawing associated with R\ or first drawing associated with 
R G. 


kt the fourth draw : 


R 

R 

R 

R 

R 

R\ 

R 

G 

G 

R 

R 

R 

G 

R 

R 

G 

R 

G 

R 

R 

R 

g\ 

R 

G 

R 

R 

G 

R 





G 

R 

G 

R 






i.e. third drawing associated with R\ or second drawing associated 
with R G. 


It is evident therefore that if we have attained success at the {n— i)th 
stage, i.e. if no two successive green balls have been drawn, we shall 
attain success at the nth stage provided that, either 

(i) the nth draw gives a red ball ; 

or (ii) if the nth draw gives a green ball the drawing at the (n- i)th 
stage gave a red ball. 

The chance of drawing a red ball = J ®, and the chance of drawing 
a green ball=^. 

If therefore u„ be the required chance after n drawings, we shall have 

ft — i.§. 1/ 4. JL® 11 

“n •“ 2 r ^n-l ^ 2T • 2 1 “n~2 • 

This is a relation connecting three successive coefficients of the series 

Uo + UiJC + Ug*^+ ... + + .... 

The series is therefore a recurring series with scale of relation 

I _ 1 c „ !L ii V.2 

'■ ai* 21 ■ 21* ■ 

Also «o= I and 12^=1, since the conditions are satisfied if no ball or 
one ball is drawn. 

Proceeding in the usual way, we find that the generating function is 


+ X (ui— i6uq/zi) j+^xjzi 

(i +4JC/21) (i —zoxjzi) (i +4a:/2i) (i —zoxjzi) 


25 I II 

24’!— 2oa:/2i 24’i+4.v/2i 

24 V 21 / 24 V 21/ 
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The required chance, is the coefficient of in these two expan- 
sions. This is easily seen to be 

5 V[ 2 S (M)“-(-i)”(A)"]- 

Note. As an alternative method of obtaining the coefficients of (|®)" 
and ( — 2*1-)" we may say that u„=a (-^)". a and b can then 

be found from the conditions i/,, = i and iq = i . 

The following examples are illustrative of the same method of 
attack. 

Example 27 . 

A player tosses a coin and is to score one point for every head turned 
up and two for every tail. He is to play on until his score reaches or 
passes n. If is his chance of attaining exactly «, show that 

Pn = h{Pn-X+Pn-i) 
and hence find the value of p„ . 

There are two ways of reaching n exactly, namely, by throwing 

(i) a tail when the score is « - 2 ; or 

(ii) a head when the score is n — 1. 

The respective chances are \pn-i and ^pn-\- Since these are mutually 
exclusive, we have 

Pn^hiPn-l+Pn-i)- 


The value of p„ may be found in either of two ways : 


(«) 

Pr> = i{Pn-l+Pn-i), 


or 

Pn + kPn-l = Pn-l + kPn-f • 

(0 

Also 

= . , 


and 

1 obviously. 

(2) 


By repetition of (i) and use of the facts in {2) we find that 


which simplifies easily to 



(6) We may treat i — ^x — as the scale of relation of a recurring 
series and proceed as in Example 26. 
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If the series is 

pg+PiX+p^^+p^+ ... +/«*"+ •••. 

A = i./>*=i and />o=i- 

The generating function of the series is 

I _ I 

I - “ (i ~ (l + ^*)’ 

By partial fractions this becomes 

2 I 

3(i-Jc) 3 

Expanding by the binomial theorem, we obtain for the value of p„ 
(the coefficient of x”) the same result as is found by the first method. 

Example 28. 

A has 10 counters and B has 5 ; their chances of winning a single 
game are in the ratio 2:1. The loser in each game is to give a counter 
to his opponent. The game stops when one or the other has lost all his 
counters. Find A’s chance of winning all B’s counters. 

Let u„ be .4’s chance of winning all B’s counters when A has n 
counters. In the next game A must either win or lose a counter. His 
chances of these contingencies are § and J respectively. When he has 
lost the next game his chance of winning all B’s counters is m„_j and 
when he has won the next game it is . 

Hence «n = l«n+l + • 

It is required to find Uj, from the above relation, given that «jj = i 
(since A will then have won) and «o = o (since he will have lost). 

The required relation for the recurring series is therefore 

f-x+J**. 

Since we may write this relation as 

§ (i-fx + ix2) = §(i-ix)(i-x), 

we may put w„ — a HY' + b, and then obtain a and b from the values of 

and U15. 

This is a particular example of the problem of “duration of play’’. 
The problem in its general form gives m counters to A and n counters 
to B, and states that A’s chances of winning, drawing and losing a single 
game are />, q, r respectively, where p + q + r=i. The method of solution 
is precisely similar to that above. 
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The inductive method can be adapted to other types of question in 
probability. An excellent example will be found in J.I.A. vol. lvi, 
pp. 102-104, where a problem in direct probability is solved very simply 
by the inductive process. 

22 . We will conclude this chapter with some miscellaneous ex- 
amples. These problems require only a careful application of the 
ordinary methods, and no special comment is necessary. 

Example 20 . 

The sum of two positive quantities is constant and equal to zn. Find 
the chance that the product of the two quantities is not less than | their 
greatest product. 

The product of two positive quantities whose sum is constant is 
greatest when the quantities are equal. The greatest product is therefore 

zn zn 
— . — —n‘. 
z z 

If the two quantities are x and zn — xv/e must have 

/ \ 3 ”'* 

X {zn — x)>-- , 

4 

i.e. 4x'^-8nx + 2n^^o, 

i.e. (zx — 3n) {zx — n) ^ o. 

Therefore x must lie between — ” and ”. 

2 2 

The possible values of x range from o to zn, and the chance required 
is therefore 


Example 30. 

T en clubs compete annually for a cup which is to become the absolute 
property of the club which wins it for three years in succession, .\ssuming 
that all the clubs are of equal skill, find the chance that last year’s 
winners, not having won the previous year, will ultimately win the cup. 

X 

Let ^ be ^’s chance of winning outright, A having won the previous 
year. Each of the others has a chance equal to 

X 

I 

10 



since the clubs are of equal skill. 


JL 

2« i 2 2! ^ 


FMA S 


16 
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Now (a) A’s chance of winning next year and the year after 

=(^) • 

(b) A’s chance of winning, losing and then having the same 

chance as the other eight losing clubs 

X 

^ I _9 ^~io 
10’ 10' Q ‘ 

(c) A’s chance of losing and then having the same chance as 

the other eight losing clubs 



A's total chance = (a) + (b) + (c), and this we know to be 

10 

the resulting equation, we find that ^’s chance is 


Solving 


Note. The above solution assumes that the total probability of win- 
ning, for all the clubs, is unity, i.e. that the cup must eventually be won. 
It is easily seen that the cup must be won outright. If the chance that 
any particular club fails to win the cup outright after m trials is i/n, 
where n is greater than i, then if there be an infinitely large number of 
sets of trials, the chance that the cup is never won outright by any 

/,\km 


particular club will not be greater than Lt ( - which is obviously 

*->-00 \w/ 


zero. In other words, if the contests be continued for a sufficient length 
of time, the chance that the cup is not won outright is zero, i.e. the cup 
must be won. 


Example 31. 

The atoms of a certain radio-active element are continuously dis- 
integrating at the uniform rate of per annum, those atoms that dis- 
integrate thereupon becoming atoms of a different element. What is the 
average period of years that an atom of the original element will survive 
as such, and what is the probability that any particular atom will survive 
that period? 

Let the year be divided into n instants. 

Then the chance that an atom disintegrates in any one instant is 

and „ „ „ does not disintegrate „ „ i — . 
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Now let / (r) be the chance that the atom has not disintegrated at the 
end of the rth instant. 


Then 




and the average number of instants required for disintegration is 
1+ S f{r)=i+ S (i ) 

r-l r-i V 3oon/ 


I- I — 




3oo«/ 

= 3oon, i.e. 300 years. 

The chance that an atom survives this period is 


Lt 

n-^ 00 


, \a00n 

I =e-i. 

30on/ 


EXAMPLES 10 

1. Explain how the probability of a compound event, consisting of 
two constituent simple events, is obtained. Illustrate your answer by 
examples. 

2. The chance of one event happening is the square of the chance 
of a second event, but the odds against the first are the cube of the odds 
against the second. Find the chance of each. 

3. If three squares are chosen at random on a chess-board, show that 
the chance that they should be in a diagonal line is . 

4. A man has three current English coins. Find the chance that he 
can give change for half a crown. 

5. A can hit a target four times in 5 shots; B three times in 4 shots; 
C twice in 3 shots. They fire a volley ; what is the probability that two 
shots at least hit? 

6. A and B stand in a ring with ten other persons. If the arrangement 
of the twelve persoiw is at random, find the chance that there are exactly 
three persons between A and B. 

7. The first twelve letters of the alphabet are written down at random. 
What is the probability that there are four letters between the A and 
theB? 
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8. Find the chance of drawing two white balls in the first two draws 
from a bag containing five red and seven white balls, balls drawn not 
being replaced. 

9. An experiment succeeds twice as often as it fails. Find the chance 
that in the next six trials there will be at least four successes. 

10. If an experiment is equally likely to succeed or fail, find the 
chance that it will succeed exactly n times in zn trials. 

11. Find the chance of throwing ten with four dice. 

12. If a die whose faces are numbered from i to 6 is thrown four 
times, what is the probability that the sum of the four throws is 14? 

13. A five-figure number consisting of the digits 0, i, 2, 3, 4 (no 

repetitions) is chosen at random. What is the chance that it is divisible 
by 4? s 

14. Out of a bag containing thirteen balls, six are drawn and replaced, 
and then seven are drawn. Find the chance that at least three balls were 
common to the two drawings. 

15. If a die is thrown five times what is the probability that a six 
appears on at least two consecutive occasions.^ 

16. What is the chance that a person with two dice will throw aces 
exactly four times in six trials? 

17. There arc m candidates taking an examination paper of n ques- 
tions of equal difficulty; assuming that a candidate answers a question 
correctly or not at all, either being equally likely : 

(a) In how many different ways may a paper be answered? 

(b) How many different sets of answered papers are possible? 

^ (c) What is the chance that a set of papers is handed in in which 
a particular question is solved by not more than one candi- 
date? 

18. Six cards are chosen at random from a pack of 52. Find the 
chance that three will be black and three red. 

19. A card is chosen at random from each of six packs of cards. Find 
the chance that three will be black and three red. 

20. The 26 letters of the alphabet are placed in a bag. A and B 
alternately draw a letter from the bag, the letters drawn not being 
replaced. The winner is the one who draivs most vowels. A starts and 
draws a vowel with his first draw. What is his chance of winning? 

21. A book contains 1000 pages. A page is chosen at random. What 
is the chance that ^e sum of the digits of the number on the page is nine ? 
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22. A bag contains three tickets marked with the numbers 00, 01, 
10, and two tickets each marked with the number 1 1. A ticket is drawn 
at random eight times, being replaced each time. Find the probability 
that the sum of the numbers on the tickets thus drawn is 33. 

23. If X be one of the first hundred numbers chosen at random, find 

the probability that x + is greater than 50. 

24. In a lottery there are 1000 tickets numbered i to 1000. Three 
tickets are drawn. Find the chance that 

(1) the three tickets bear consecutive numbers; 

(2) two of the three bear consecutive numbers. 

25. If m odd integers and n even integers be written down at random, 
prove that the chance that no two odd numbers are adjacent to one 
another is 

n ! (n+ i) ! 

(m + «) ! (« — »J + 1) !’ 

m being not greater than n+ 1. 

26. The sum of two whole numbers is 100; find the chance that their 
product is greater than 1000. 

27. There are ten tickets, five of which are numbered i, 2, 3, 4, 5 
and the other five are blank. What is the probability of drawing a total 
of 10 in three trials, one ticket being drawn and replaced at each trial? 

28. If two of the first hundred numbers are chosen at random, what 
is the probability that their difference is greater than 10? 

29. A and B have equal chances of winning a single game; A wants 
two games and B wants three games to win a match. Find the chance 
that A will win the match. 

30. A and B play at a game which cannot be drawn. On the average 
A wins three games out of five. Show that it is more than 2 to i that A 
would win at least three games Out of the first five. 

31. A, B, C throw in order, each using three dice. Prove that A’s 
chance of throwing 10 first is and find C’s chance. 

32. A and B play for a prize. A is to throw a die first and is to win 
if he throws 6. If he fails, B is to throw and win if he throws 6 or 5. 
If he fails, A is to throw again and win if he throws 6 or 5 or 4, and so 
on. Find each player’s chance of winning. 

33. A and B play for a stake which is to be won by him who makes 
the highest score in four throwrs of a die. After two throws A has scored 
12 and B 9. What is A’s chance of "inning? 
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34. A and B play a set of games, to be won by the player who first 
wins four games, with the condition that if they each win three they are 
to play the best of three to decide the set. A’s chance of winning a 
single game is to B's as 2 to i . Find their respective chances of winning 
the set. 

35. A, B and C draw in succession from a bag containing four white 
and eight black balls until a white ball is drawn. What is the probability 
that the white ball is drawn by C? Is his chance improved if each ball is 
replaced after drawing? 

36. Three players of equal skill, A, B and C, play a series of games 
and the winner of each game scores one point. Each of the three keeps 
a separate score and the winner of the set is the one who first scores 4. 
A wins the first, B the second and A the third game. What is then C’s 
chance of winning the set? 

37. A and B play a match of five games. j 4 ’s chances of winning, 
drawing and losing any game are in proportion to 3, 2 and i respectively. 
Two points are scored for a win and one for a draw. What is the chance 
that the match is drawn ? 

38. A and B play a match, the winner being the one who first wins 
two games in succession, no games being drawn. Their respective chances 
of winning any particular game are p : q. Find 

(1) .< 4 ’s initial chance of winning; 

(2) A’s chance of winning after having won the first game. 

39. Three players. A, B, C, play under the following conditions. In 
each turn the chance of success is the same for each of two contestants. 
A and B play together for the first turn, the winner plays with C, and 
if he win again he wins the game; if not, C plays with the third man 
and 80 on until one man has won two turns in succession. Find each 
man’s chance of winning the game. 

40. The winner of a game is the one who first scores 4 points, but 
if both players score 3 points the game continues until one player has 
scored 2 points more than the other, A and B play; find A’s chance 
of winning when the score is 2 — o in B’s favour, being given that A 
is twice as skilful a player as B. 

41. A, B, C, D each throw two dice for a prize. The highest throw 
wins, but if equal highest throws occur (thrown bj two or more players) 
the players with these throws continue. A throws 10; find his chance 
of winning. 

42. A and B cut a pack of cards, the player who wins the cut six 
times to be the winner. A, having won four times to B’s once, cuts a 
five. Find the cb^ce that A will be a winner. 
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43. A and B play a match consisting of a maximum of nine games. 
The chance that any game is won by A, won by B or drawn are equal. 
A win counts one point and a draw half a point (to each). The match 
ends when one of the players has a sufficient lead to leave him with an 
excess of points over his opponent even if the latter were to win all the 
remaining games. What is the chance that the match ends with the 
7th game and not before? 

^ 44. A bag contains ten counters, numbered 1 to 10. One counter is 

drawn and replaced and this operation is repeated imtil four different 
numbers have appeared. Calculate the probability that success will be 
attained with the sixth draw. 

45. If n is the product of any 69 integers taken at random, find, to 
the first significant place of decimals, the value of the probability that 
n is not a multiple of 5, given that logj^ 2 = -3010300. 

46. A bag contains counters marked with the digits 2, 4, 6, 8 and 
the number of times each digit occurs is equal to the value of the digit. 
Counters are drawn one at a time, each counter being replaced when 
drawn. What is the chance 

(1) that the digit 2 is drawn before the digit 8; 

(2) that the sum of the first three digits is 16; 

(3) that the first five counters drawn contain at least one of those 

marked 4 or 6? 

47. A and B have each eight pennies. Each tosses his set of pennies. 
Find to three places of decimals the chance that the number of heads 
obtained by A exceeds the number obtained by B by at least three. 

48. At a certain age 99 per cent, of the persons alive at the beginning 
of the year will live to the end of the year. Find expressions for the 
probabilities that out of four persons of that age there will die within 
the year 

(1) exactly 2; 

(2) not more than 2; 

(3) two specified persons; 

(4) two specified persons and no others. 

49. If the probability that exactly three lives out of six all aged x 
survive « years is -08192, find the probability that at least three survive 
n years. 

50. The probability that exactly one life out of three fives aged 20, 
35 and 50, will survive 15 years is 092; the probability that all will die 
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within 15 years is -006. If the probability that the life aged 20 will 
survive 15 years is -9, find the probability that 

(1) he will survive 30 years; 

(2) he will survive 45 years. 

51. Three men, .( 4 , C, are each aged 30. Given that the probability 
that a man aged 30 will survive 5 years is -974 and the probability that 
he will survive 10 years is -940, find the chance that between the end of 
the 5th year and the end of the loth year from now 

(1) one at least will die; 

(2) all will die, A dying first and B second. 

Find also the chance that A and B will die within this period and C will 
survive the loth year. 

52. The following table shows the probability that a woman of the 
age specified will marry in a year; 


Age 

Probability of marriage 

20 

•0665 

25 

•1033 

30 

•0649 

40 

•0183 


Find the probability that, out of 4 women aged 20, 25, 30, 40 re- 
spectively, only one marries within a year. 

53. Given that the probability that of three lives aged * one, and one 
only, will survive n years is 27 times the probability that all will die 
within n years, find the probability that 

(а) at least two will survive n years ; 

(б) at least one will die within n years. 

54. Given the following table, find the probability that of four persons 
aged 65 at least one will die between ages 75 and 85 and at least one 
after age 85; 

Age Probability of surviving 10 years 

65 One-half 

75 One-fifth 

55. If m things are distributed amongst a men and b women, show 
that the chance that the number of things received by the group of 
men is odd is equal to 

I (ft + a)"* — (A — a)™ 

2’ (ft-t-a)™ 

^^56. The sum of two positive quantities is constant and equal to zn. 
What is the chance that their product is less than 
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57. How many times must a man be allowed to toss a penny so that 
the odds may be 100 to i that he gets at least one head? 

58. A coin is tossed m t-n times {m>n). Prove that the chance of 
at least m consecutive heads appearing is ■ Find also the chance 
of a run of exactly m consecutive heads. 

. •59. If ten different things be distributed among three persons, show 
that the chance of a particular person having more than five of them 
19 6 8 a • 

60. If p be the chance that an odd number of aces turn up when n 
ordinary dice are thrown, show that i — 2/> = (f)". 

61. A pack of cards has been dealt in the usual way to four players. 
One player has just one ace ; prove that the chance that his partner has 
the other three aces is ^^5. 

62. A bag contains a certain number of balls some of which are white. 
I am to get a shilling for every ball so long as I continue to draw white 
only, the balls drawn not being replaced. An additional ball not being 
white is introduced and I claim as compensation to be allowed to replace 
every white ball that I draw. Is this fair? 

63. There are three sets of cards, red, yellow and blue. Each set 
contains ten cards, numbered i to 10. Three cards are drawn at random. 
Find the chance that the sum of the numbers on them equals 15: 

(1) if the cards are all to be drawn from the red set; 

(2) if one card is to be drawn from each set; 

(3) if the cards are to be drawn from the three sets mixed indis- 

criminately. 

64. From a bag containing ten red, ten white and ten blue balls one 
is to be drawn at random and replaced. The operation is to be repeated 
ten times. Find the chance that at least one ball of each colour will be 
drawn. 

65. Out of 3« consecutive integers three are selected at random. 
Find the chance that their sum is divisible by 3. 

66. In a book of values of a certain function there is one error on 
the average in every m values. Find the number of times, r, a value 
must be turned up at random in order that you may have an even chance 
of turning up an erroneous value. Show that when m becomes large 
the ratio rjm tends to a fixed quantity and find this quantity. 
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67. A, B, C and five other football teams enter for a competition. 
The teams are of equal skill and are drawn by lot in pairs before each 
round, the winners of the previous round entering the next round. 
Find the chance that in the course of the competition A will beat B, 
having first beaten C. 

68. A street consists of 24 houses numbered i to 24, odd numbers 
on one side, even on the other. Three houses are vacant. Assuming that 
the houses are all equally likely to be vacant, find the chances ; 

(1) that the three houses are next to each other; 

(2) that all three are on the same side of the street ; 

(3) that if they are all on the same side the sum of their numbers 

equals 42. 

69. A put 10 balls in all, some of which were white, some red, 
and some black, into a bag, and asked B and C, neither of whom 
knew the number of balls of each colour, each to make 1000 draws 
and note the colour of each ball drawn. B’& and C’s results were 
exactly the same, viz.: 

White 600 

Red 300 

Black 1 00 

It subsequently turned out that whereas B had returned each ball 
to the bag after noting its colour, C had kept the balls drawn in hand 
until he had drawn all ten balls and had then returned them to the 
bag and made another 10 draws, and so on. 

What difference does this make to the probabilities of drawing balls 
of different colour as inferred from the results of the experiment, and 
what in the circumstances was the probability that B and C would 
obtain the same result? 

70. Find the chance that the sum of the numbers on three cards 
drawn at random from an ordinary pack of 52 cards amounts to 21, all 
the court cards counting as 10. How will the result be altered if an ace 
can count as i or 1 1 ? 

71. If a coin be tossed 15 times, what is the probability of getting 
heads exactly as many times in the first 10 throws as in the last 5? 

72. A bag contains eight counters numbered i to 8. Four are drawn 
at random. Find the chances that 

(1) the sum of the numbers on the four coimters amounts to at 

least 17; 

(2) the counters numbered 2 and 3 are among the four; 

(3) the four counters contain at least two of the three numbers 

3. 5. 7. 
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73. If 6n tickets numbered o, i, 2, ... 6« — i are placed in a bag and 
3 are drawn out, find the chance that the sum of the numbers on them 
18 equal to 6n. 

74. From a bag containing nine red and nine blue balls nine are drawn 
at random, the balls being replaced. Show that the probability that four 
balls of each colour will be included is a little less than |. 

75. A looks at a clock at some time between 2 and 5 p.m., all times 
within the limits being equally likely. He looks again when it strikes 
the next quarter hour. What is the chance that in the meantime the 
minute has overtaken the hour hand? 

76. Four suits ot cards, each suit consisting of 13 cards numbered 
from I to 13, are dealt to four persons. Find the chance that each 
person’s cards contain all the numbers from i to 13. 

77. There are four sets of calculations on one sheet which have to 
be made, then checked and finally scrutinized. A and B can calculate 
only, C and D can calculate or check, and E and F can scrutinize only. 
No person may check a calculation he has made and all work must be 
signed. Find the chance that when the sheet is finally completed each 
name of the above six appears exactly twice. 

78. From an ordinary pack of cards a card is drawn and then six 
other cards at random. Find the chance that the card first drawn is the 
highest of Its suit amongst all the cards drawn 

79. A bag contains thirteen balls of which tour are white and nine 
black. If a ball be drawn r times successnely and replaced after each 
drawmg, show that the chance that no two successive drawings shall 
have given white balls is 

i6.i2’’- ( -3 )’' 

1513' 

80. The reserved seats m a certain section of a concert-hall are 
numbered consecutively from 1 to 100. A man sends for five consecutive 
tickets for one concert and for eight consecutive tickets for another. Find 
the chance that there will be no number common to both sets of tickets. 

81. In a cup draw tliere were four Southern and four Northern 
teams, the names being drawn one by one from a bag. Find the fol- 
lowing probabilities; 

(1) that each one of the four successive pairs consisted of one 

Southern and one Northern team ; 

(2) that each one of the four successive pairs consisted of two 

Southern or two Northern teams ; 

(3) that the first drawn in at least three of the pairs was a Southern 

team. 
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82. A number consists of seven digits whose sum is 59. Find the 
chance that it is divisible by ii. 

83. Two dice are thrown and one of the players will win (a) if the 
sum be 7 or ii, or (b) if the sum be 4, 5, 6, 8, 9 or to and the same 
sum reappears before 7. Find the player’s chance of success. 

84. A and B cast alternately with two dice. It is agreed that, on each 
failure to win, the prize money is to be reduced by 3 per cent, of its 
value at the previous attempt. A wins if he throws 6 before B throws 7, 
and B wins if he throws 7 before A throws 6. A starts first. Compare 
the values of the respective chances of A and B. 

85. A bag contains n counters marked i, 2, 3 ... n. If two counters 
are drawn show that the chance that the difference of the counters 
exceeds m (less than n- i) is 

(n — m)(n — m — i) 
n{n-i) 

Deduce from this result the chance that if two points are taken at 
random on a line the length between them exceeds half the length of 
the line. 

86. A bag contains two white balls and one black ball. A drawing 
of two balls is made. If either is black, the two are replaced and another 
black ball is added. A second drawing of two balls is then made, and 
again if either is black, the two are replaced and another black ball is 
added and so on. 

What is the chance that if the drawings are continued indefinitely 
two white balls will never be drawn together? 

87. A’a chance of scoring any point is f of B’a. A engages to score 
14 in excess of B before B shall have scored 3 in excess of A. Show that 
A’s chance of winning the match is equal to 

»-(tf 

88. A and his wife engage in a “mixed doubles” tennis tournament 
in which each pair of players consists of one member of each sex. There 
are fourteen other persons, seven of each sex, also entered for the 
tournament and players are drawn by lot before each round in such a 
way that any person of one sex may be the partner of any person of the 
other sex. Only the winners in any one round enter the next round. 
Assuming that all the players of each sex are equal in skill, find the 
probability that in the final round A and his wife play together as 
partners. 
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89. All that is known about a quadratic equation is that the coefficients 
are all different and are positive intep;ers less than 10. Find the chance 
that the roots of the equation are real, all the integral values of the 
coefficients satisfying the above conditions being equally likely, and zero 
values of the coefficients being excluded. 

90. A pack of cards is dealt in the usual way to four players of whom 
two and two are partners and the dealer turns up his last card. Denoting 
by the term “honours” the ace, king, queen, knave of the suit to which 
the turned-up card belongs, find the chance that each pair of partners 
shall have two honours. 

91. A number taken at random is squared. Find the chances that the 
following are even numbers : 

(1) the digit in the units place of the result; 

(2) the digit in the tens place of the result ; 

(3) the digit in the hundreds place of the result. 

92. Before commencing a game of cards four players cut for partners, 
i.e. the two highest play together and the two lowest together. All suits 
being of equal value, what is the chance that they will have to cut again? 

93. A and B play a series of games to be won by the player who first 
wins two consecutive games, ^’s chances of winning, losing or drawing 
any particular game are i, J and J respectively. Find B’a chance of 
winning the match (a) at the outset, (b) when he has just won one game, 
and (c) when he has just lost one game. 

94. A and B have equal chances of winning a single game. A wants 

n games and B n+ i games to win a match. Show that the odds on 

. , {zn ) ! 

A are i +p to i -p where ^ 

95. Three posts are filled one after another by lot from amongst ten 
persons {A, B, C, D, and six others), the first by any one of the ten, the 
second by anyone except A and B, the third by anyone except C and D. 
What is the chance that /I or 5 or both of them, and C or D or both of 
them, are chosen? No one can hold more than one post, and in drawing 
for the second and third posts the barred persons and any person 
previously chosen are excluded. 

96. With a hand of thirteen cards a player is known to hold one ace. 
What is the chance that he has at least one other ace? 

If it is known that the ace he holds is the ace of hearts what is the 
chance that he has at least one other ace? 
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ELEMENTARY STATISTICS 

1. It is not the object of this book to deal exhaustively with the 
science of statistics. We shall treat only with the elementary ideas 
of the subject, and this treatment should be sufficient to enable the 
student to solve the simpler types of problem that are met with in 
practical work. 

Statistical investigation covers a very wide field and is concerned 
with the collection, tabulation and summarization of data, the in- 
terpretation of the results and the deductions that can be made 
therefrom. Collection and tabulation must here be taken for 
granted and the other aspects will be considered to a limited degree 
only. 

2. Elementary definitions. 

Suppose that a number of men were selected at random and 
arranged in order of their heights, and suppose that these men 
formed a representative and unbiased sample of the particular class 
under review. We should be able by mere inspection to obtain some 
idea of the variable characteristic — in this case, height — of the class. 
For some purposes this might be sufficient, but, in general, this 
method, namely, the assembling of the men, is quite impracticable. 
Further, we should probably need records of the experiment, and 
it is essential therefore to arrange the work so that these records 
may be readily made. 

If we were to represent the men by straight lines proportional 
to the heights of the men, drawn at equal intervals at right angles to 
a fixed straight line, and retained the order of magnitude as before, 
we should be in possession of a permanent record of the heights. 
An arrangement such as this is termed an array and the individual 
values of the measured characteristic are called variates. As an 
illustration of this pictorial review of a particular characteristic we 
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may consider the following figure, which shows the heights of 
42 men, all of whom are under 5 ft. 9 in. For convenience of 
illustration only the excess of height over 5 ft. is given, and 
fractions of an inch are ignored. 



An examination of this array shows that there were 2 men whose 
iieight was 5 ft. 3 in., 2 whose height was 5 ft. 4 in., and so on. 
These numbers, which from the diagram are 2, 2, 2, 5, 12 and 19, 
are known as the frequencies with which the heights occurred. 

The full sample of which the above is an extract consisted of 100 
men for whom the table of frequencies is as under; 


Height of man to the 
nearest inch 

Number of 
cases 

5 fi. 3 in. 

2 

5 4 

2 

5 5 

2 

5 6 

5 

5 7 

12 

5 8 

19 

5 9 

16 

5 10 

16 

5 ” 

14 

6 0 

7 

6 I 

2 

6 2 

3 


100 
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These data can be arranged to advantage in an array, not of 
variates, but of frequencies, in either of the following forms : 



Fig. 7- 



Fig. 8. The block diagram in thia figure ia called a histogram. 


It should be noted that whereas Fig. 6 represents an array of 
heights with one scale only, namely height, Figs. 7 and 8 necessitate 
the adoption of two scales, height and frequency. In these figures 
we have, in fact, shown values of a variable (frequency) compared 
with those of the independent variable (height). 

3. Reference to the table on p. 255 will show that the figures in the 
first column are given to the nearest inch. It is obvious, however, 
that, for example, the 19 men whose heights were stated to be 
5 ft. 8 in. were not all exactly of this height. “Heights of 5 ft. 8 in. 
to the nearest inch ” means any height between the limits 5 ft. 7J in., 
to 5 ft. 8J in., and this interval of one inch is called the class- 
interval. The recorded height of 5 ft. 8 in. may be taken as the 
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mid-value of the class range. Data may be displayed either in 
the form given in the table on p. 255, or with the class range 
indicated by its lower limit, as shown in the following example 
relating to the age distribution of a body of lives: 



I’ icquency 

20-5- 

38 

''’• 1 - 5 - 

70 

22-5- 

82 

23-5- 

lOI 

245- 

128 

255- 

175 

26-5- 

iSi 

27-5- 

200 

28 - 5 - 

181 

29-5- 

190 

1 134^’ 


Here the class-interval is one vear, and it is often assumed for 
practical purposes that there wwre 38 cases of age 21, 70 of age 22, 
and so on. In many distributions it will be found convenient to 
use a larger class-interval than a unit interval ; e.xamples of various 
class-intervals will occur later in this chapter. 

A set of relative frequencies tabulated according to class ranges 
is called a frequency distribution. Given an ungrouped set of ob- 
sen’ations it is a matter for careful consideration as to the best form 
of grouping to adopt in order to obtain a reasonable frequency 
distribution from which to work. 

The following is an important point which may arise in the con- 
struction of a frequency distribution. Suppose that in c.xamining 
the data for a distribution such as that in the above table it was 
found that there were two persons exactly 2ii years old at the date 
when the observations were made. In such a case it is the practice 
to assign one of these to the group 20I-21I and the other to the 
group 21^-22^, since they may be said to belong equally to each 
of the groups that they divide. If there had been three of such 
cases, i^ would be assigned to the lowvr age-group and ij to the 
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upper group. It may happen therefore that although we are dealing 
with integral numbers of cases the distribution may show fractional 
frequencies when the table is completed. 

4. Averages. 

It will be convenient at this stage to revert to a pictorial repre- 
sentation of an array of variates as exemplified in paragraph 2. In 
the following example a small sample only has been taken in order 
that the deductions that may be drawn from the figure may be 
clearly shown. 

The data relate to the ages of a class of 25 students and are given 
in the following table : 


Nearest age 

Number ot 
stuJcnts 

18 

X 

19 

2 

20 

I 

21 

6 

22 

3 

23 

4 

24 

I 

25 

1 

26 

1 

27 

I 

28 

4 


25 


The array of variates is 



Fig. 9. 



AVERAGES 


259 

There are 25 variates depicted in this diagram in order of magni- 
tude, and since there happens to be an odd number, we can at once 
single out the middle variate. The middle variate corresponds to an 
age 22 and is a particular variate of our sample. The statistical name 
for the middle variate is the median, so that the median, or to be 
more precise the median age, for this distribution is 22 years. If 
therefore a number of such experiments were made with different 
samples of similar students we should be in a position to compare 
one average characteristic of each sample, namely the median age. 

If we examine the diagram and table again, we see that the 
variates which occur with the greatest frequency are those repre- 
senting the number of students at age 21. We have found the mode, 
or most frequent value, and this again is another average. In this 
case it happens that the mode is immediately apparent from the 
table giving the frequency distribution, although the median can- 
not be deduced without further investigation. 

Now an average may be defined as a quantity which typifies the 
magnitude of the set of variates from which it has been ascertained, 
and it may occur to the reader that the simplest average that could 
be found from the given data would be the ordinary arithmetic 
mean or, if we were using a frequency table, the weighted mean. 
This is so, and by simple calculation it is evident that the mean is 
22-8 years. 

We thus have three averages: 

Mean ... ... 22-8 years 

Median ... ... 22 years 

Mode ... ... 21 years 

and it remains for us to determine which of these averages is the 
most satisfactory for statistical purposes and, in addition, what use 
we can make of them. 

It should be noted that frequency distributions may be illustrated 
by curves drawn through the peaks of the ordinates of a frequency 
diagram such as that in Fig. 7. While the positions of the mean, 
mode and median are often represented on frequency curves by 
ordinates, it is really not the ordinates but the corresponding 


17-2 



ELEMENTARY STATISTICS 


260 

abscissae which give the values of the mean, etc. Thus, in the fre- 
quency curve in Fig. 10 below, the values of the three averages 
are XA, XB and XC and not the ordinates through A, B and C. 


A B 


— mode j 

1 

medz/xn, ^ 

1 

1 

mean 

1 


Fig. 10. 


5. It is not immediately apparent which particular kind of average 
is most suitable for any group of observations, and it requires a con- 
siderable amount of experience before a student can use these 
averages to their best advantage. There are, however, criteria which 
enable us to select the most satisfactory form of average for general 
adoption. The principal criteria may be stated as follows: 

A satisfactory average should be 
{a) rigidly defined; 

(6) based on unbiased observations as numerous as possible; 
(c) relatively stable with fluctuations of sampling; i.e., though 
different samples might produce slightly different re- 
sults, the variation from sample to sample should be as 
small as possible; 

{d) capable of easy calculation and simple algebraic treatment. 

Let us consider the example in paragraph 4. If it had happened 
that our data included no cases over age 27 — i.e. the four cases at age 
28 were non-existent — the mean age of the 21 cases observed would 
be about one year younger than before— about 21-8 years — while the 
median and the mode would remain unaltered. It appears from 
this simple example that a few extreme cases more or less may 
considerably affect the mean, while the median and mode are un- 
changed. But this possible disadvantage of the mean disappears 
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when larger distributions are under consideration. In point of 
fact, the mean possesses several advantages over the median and 
the mode. It is rigidly defined, while the median is somewhat 
indefinite. Again, the mean can easily be calculated and lends itself 
to simple algebraical treatment, while, as will be seen later, it is 
generally impossible to obtain more than estimates of the other two 
averages. For these reasons the mean is the average that is used to 
a very great extent in statistical calculations. 

6. Calculation of the mean. 

Let n be the number of observations, x, the value of the variate 
in the rth class-interval and the frequency in the rth class- 
interval, so that 'Zf^ = n. 

Then M, the mean, is simply S or - S (Lx,.), where 

n 

the summation extends over all the class-intervals. This evidently 
gives a weighted mean value of x^. 

The calculation of jT/can obviously be effected from this formula 
whatever the size of the distribution and wherever in the range the 
mean happens to be. If however the values of x are large (such 
as measurements in a small unit) and the frequencies are also 
numerous, a change of unit and origin will simplify the arithmetic 
considerably. 

Thus, if a is the value of the temporary origin and the 

deviation of from a, then 

X,. = a + . 

Now M=l'^{f,x,) 

n 

.= ;jaS/, + ^(/-A)] 

= ^*Jar, + S(/A)] 

= «+^'i(/A), 
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and if a be suitably chosen, i.e. so that the deviations of x from a 
are small and of different signs, it is evident that the calculation of 
2 is far less troublesome than that of S {/fX^). 

Example 1. 

Calculate the mean age of wife corresponding to the age-group bo- 
ot husband from the 1921 Census of married couples. (See Chap. X, 
paragraph 10.) 

The first point to note is that the class-interval is five years, so that 
the age of wife must be assumed to be the mid-value of the range. The 
data have been tabulated and a glance at the figures suggests that a 
suitable origin will be the mid-value of the group 55-. The method of 
calculation in the table is self-explanatory. 




Deviation from 

1 

Age (mid-value of 

Frequency 

origin in terms of 

/s ! 

class range) 

/ 

class-interval o 
as unit 

27-5 

I 

-6 

- 6 

32-5 

2 

-5 

- 10 

37-5 

4 

-4 

- 16 

42-5 

10 

-3 

- 30 

47 - S 

21 

— 2 

- 42 

52-5 

53 

“ I 

..... - 53 

57-5 

126 

0 

-157 

625 

163 

+ 1 

-1-163 

67-5 

35 

+ 2 

-t- 70 

72-5 

6 

+ 3 

-f 18 

77'5 

I 

+ 4 

+ 4 


422 


+ 255 


Thence 2 (/S) = 255 - 157 

= 98 class-intervals 
= 490 years, 

^ 2 (yS) = 490/422= I I 5 .... 

iW = 57-5 + J-i5 ■••• 

= 587 years (say). 


so that 
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An alternative method of obtaining the arithmetic mean, if the 
class-interval be taken as the unit, is based on a single and double 
summation of the column of frequencies. Thus, if the origin is 
taken at one value lower than the lowest value given, the initial 
value of the variable may be denoted by i, the second by 2 and 
so on. 

A first summation gives '^fr=fi+h+h+ +/n-2+A-i+/n- 

Summing again from the bottom upwards, 

2 (^/r)= 1/1 + 2/2 + 3/3+ ... +(«-2)A_2 + (w-i)/„_i + n/„ 

r“>n 

= S rf,- 

r-l 


If therefore the original initial value is M, so that the origin is 
at k—i, the mean value is 


k 



An example will make this clear. 


Example 2. 

Find the mean height of the sample of men from the table in para, 
graph 2. 

The lowest value of the height is 63 in. Take the origin at 62 in. 


Heifrht in 
inches 

Deviation from 
origin 

/ 


63 

1 

2 

100 

64 

2 

2 

98 

65 

3 

2 

96 

66 

4 

5 

94 

67 

5 

12 

89 

68 

6 

19 

77 

69 

7 

16 

58 

70 

8 

16 

42 

71 

9 

14 

26 

72 

10 

7 

12 

73 

II 

2 

5 

74 

12 

3 

3 



100 

700 
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Then the mean height = 62 + 700/100 = 69 in. 

In a large group it is often simpler to take the origin at k and the 
second summation up to the second line from the top. Thus, ki this 
example, the mean height = 63 + 600/100 = 69 in. 

This method is convenient in actuarial calculations where it is 
necessary to find the average age of a distribution consisting of 
comparatively large numbers. 


7 . Calculadon of the medlaiL 


The median is the middle variate of the array, and if we were to 
construct a diagram showing the array of the variates of the ob- 
servations in, say, the example above, it would be possible to select 
the middle variate. In practice it is generally impracticable to 
construct such a diagram, and a reasonable approximation obtained 
from a table of frequencies is, as a rule, sufficient for most purposes. 

Consider, for example, the table in Example i. There are 422 
cases, so that the median will lie betw'cen the 211th and 212th cases. 
Now from the frequency distribution it is seen that up to and in- 
cluding the class range 50-55 there are 91 observations, while if 
the next group be included there are 217 obsen'ations. The median 
is therefore in this last group of 126 cases. It is reasonable to 
assume that the first observation in this group has the value of the 
lower boundary of the class range, the last observation has the 
value of the upper boundary and the 126 observations increase 
uniformly within the range. Since we require the age correspond- 
ing to a hypothetical case between the 211th and 212th cases, simple 
proportion will give the median to be 


55 + 


5 (211-5 -91) 

126 ’ 


the class-interv'al being 5 years. 

Then the median =55 + ^^:^ 

126 


= 55 + 4-78 

= 59-8 years (say). 
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8. Calculation of the mode. 

The mode is the value which occurs with the greatest frequency. 
Although in a case where the variates proceed by unit variation it 
may be possible to select at sight the most likely value, this is not 
practicable where the distribution is one which concerns a con- 
tinuous variable. Thus in Example i all that can be said is that the 
mode lies in a five-yearly group of which 62-5 is the mid-interval, 
although an examination of the table might suggest that the re- 
quired value is nearer the lower limit than the upper one. 

In many cases a good approximation to the mode can be found 
from the following simple formula: 

Let M, A/,, and M„ represent the values of the mean, median and 
mode respectively. 

Then A/„ = A/-3 (A/-A/,). 

In Example i the value of the mode according to this formula is 
^^o = 5S7-3(587-5Q-8) 

= .‘i^57 + 3-3 
= 62 0 years. 

It should be emphasized that this relation though often vciy' close 
is not exact, and is not applicable to all distributions. Other 
methods have to be adopted in extreme cases, although even then 
a reasonable approximation mav often be obtained by drawing a 
smooth curve through the statistics. 

9. Ogive curve. 

When a rough estimate of the median is required it is often 
u.seful to use a grapliical method. This method is best illustrated 
by an actual example. 

If we take the data from the table on p. 2 (>3 we may write them 
down in an alternative form, thus: 
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Height of not more 
than X inches 
* 

No. of cases 

63 

2 

64 

4 

65 

6 

66 

II 

67 

23 

68 

42 

69 

S8 

70 

74 

V 

88 

72 

95 

73 

97 

74 

100 


Then, if the numbers so found be plotted against the heights and 
the points joined by a smooth curve, we have an ogive curve. 
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An approximate value of the median will be the abscissa corre- 
sponding to the ordinate midway between the 50th and 51st cases, 
i.e. about 68^ inches. 

The mode can also be estimated from the ogive, its position 
being where the slope of the curve is steepest. In many cases 
however the shape of the curve is such that all that can be said is 
that the mode lies betv^'een certain comparatively wide limits, and 
in those circumstances no better approximation can be obtained 
from the curve than is evident from a consideration of the statistics 
in their original tabular form. 

10. Probable value and expectation. 

Before we proceed further with the discussion of the mode or 
most likely value, it will be convenient to refer to another aspect of 
the mean. 

Suppose that there are n counters in a bag, each counter having 
a value in shillings. Then, if we are given the number of counters of 
each particular value it is a matter of simple arithmetic to find the 
average value of a counter. All that we need do is to weight the 
number of counters having a particular value by that value, add the 
products and divide by the number of counters. 

Example 3. 

Find the weighted mean of the following distribution : 

No. of counters (x) 9 1 2 5 7 

Value of counter (y) 12345 

in shillings 

Total value of counters =I!.ry 

= 9-)-2-(-6-t- 20 + 35 

= 72. 

Total number of counters = Xx = 24, 

Weighted mean =Xxv/ilx — 3 shillings. 

We may look upon this problem from another point of view. 
Suppose that we use the word prize to denote the value of a quan- 
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tity to be attained. In general, the attainment of a prize is con- 
tingent upon the happening of some event and may therefore not 
always be secured. If the amount of the prize is S and it is obtained 
in pN trials out of a total of N in which the opportunity of obtaining 
the prize can occur, then the probability of its being obtained is p. 
Also, the total value of all the prizes is SN and the average value is 
pS. This average value is called the expected value or probable value. 
If we are concerned with the person who is playing for the prize 
the probable value of the prize is called his expectation. In other 
words, expectation is the product of an expected gain in actual value 
and the mathematical probability of obtaining such a gain. It may 
be noted that the gain or prize is not necessarily restricted to 
financial gain, but may be a number of years of life or a number of 
heads in a series of throws. 

Let us consider the problem above in the light of this definition, 
and calculate the expectation of a man who is allowed to draw one 
counter at random from the bag. 

The chance that he draws a counter value one shilling = t/j ; 
the chance that he draws a counter value two shillings = j , 
and so on. 

His expectation, which is the same as the probable value of his 
draw, is, therefore, 

(/s) I + (^) 2 + (-a""*) 3 + (^'x) 4 + (/x) 5 

_9-i-2 + 6 + 2o + 35 
24 

= 3 shillings, as before. 

It is seen from this example that the diifercnce between weighted 
mean and expectation lies solely in the order of the operations re- 
quired to obtain the result. It is sometimes more convenient to 
treat such a question as one in which the mean is to be calculated, 
while at other times the second order should be adopted. Thus, in 
the following questions, the expectation is most easily calculated 
by introducing the chances of success and failure. 
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Example 4. 

A player throwing an ordinary die is to receive where n is the 

number of throws that he takes to throw the first 6. Find his expectation. 
His expectation 



1 5 I 

2 6 6 






,12' 7 J 



Example 5 . 

A table is divided into six squares numbered i to 6. A player places 
a coin on a certain square. Three dice are thrown. If the number thus 
backed appears once, twice or thrice, the player receives back his own 
coin together with one, two or three others respectively of the same 
value. In any other event he loses his sta!;e. Does the advantage in the 
long run lie with the player or the “banker”? 

(a) The chance that no die shows the number backed = f|)^. 


(ft) 

,, one die shows „ 

= 3 (ii)- i 

(r) 

,, two dice show ,, 

= 3 HiY- 

{d) 

,, three dice show 



The net expectation of the player 

= (/;) + 2 (c) + 3 (rf) -((/)== -ayff. 

and that of the lianker 

- -(A)-2(c)- 3 (</) + (a) = .^YTr. 

so that the advantage lies with the banker in the long run. 

For a stake of a shilling it is easily seen that this advantage is just 
under a penny. 

Here it might he contended that as the player is to receive back his 
own coin in addition to the prize, his expectation should be based on 
respective receipts of two, three or four units instead of one, two or 
three units, as appears in the above solution. His expectation on this 
basis might be argued as being 2 (ft) + 3 (c) + 4 (i/)-(a) or > which 
would show a substantial advantage to the player. Further consideration 
would show, however, that if the return of the stake is treated as a 
profit, then the laving of the stake must he treated as a payment to the 
banker for the privilege of playing and that it is, accordingly, definitely 
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paid away whether the player wins or loses. His expectation then be- 
comes 2 (J) + 3 (c) + 4 (<0 — I = “2*3^ ^ before. 

11 . In connection with mathematical expectation it is interesting 
to note the celebrated St Petersburg problem, which has been a 
fruitful source of discussion for nearly two hundred years. Briefly 
stated, the problem is this: A coin is tossed until head turns up. 
If head turns up first A is to pay B one unit ; if head does not turn 
up till the second throw B is to receive two units ; if not until the 
third throw four units, and so on. How much must B pay A 
before the game, in order that the game may be considered fair? 
That is, what is B'& expectation? 

The theoretical solution is this; 

At the first trial B’s expectation is i x i = i. 

,, second „ „ (1)^x2 =i. 


„ nth „ „ = 

B’s expectation is therefore i + | + ... to infinity, and as 

this is a divergent series, it appears that B could afford to pay an 
infinitely large sum before the game starts for his expectation. 

Many explanations of this result have been given by eminent mathe- 
maticians, notably by d’Alembert, Bernoiiilli and de Morgan. An inter- 
esting account of the problem with an alternative solution depending 
upon the amount of money that B possesses at the outset is given by 
Whitworth in his Choice and Chance. This solution depends on a 
somewhat arbitrary assumption, and a more practical limitation of D’s 
expectation arises from the fact that ^’s resources, however great, must 
be limited. Poincare {Calcul des Probability, p. 42) shows that if A ’s total 
assets are 2*’, B’s expectation is i +ip. For example, if A possesses 2®“ 
— which is more than a thousand millions — B’s expectation is reduced 
from infinity to 16, and as Poincare drily remarks, this is a considerable 
reduction ! 

12. It is essential to distinguish between the absolute probability 
of an event, the average number of times that it may happen over 
a series of trials and the most probable number of times that it will 
occur. The most probable value (or mode, in a statistical distribution) 
is the value that occurs with greatest frequency, and in simple 
examples this value can be easily determined by considering 
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separately the contingency of each event. Thus in a single throw 
with two dice the numbers that may turn up and the chances of 
occurrence of these throws are given in the following table ; 

Possible numbers: 23456789 10 ii 12 

Chances of occurrence : 12345654 3 2 i (-^36) 

The most probable number to be thrown is therefore 7. 

Again, in Example 7 on p. 272, a hasty conclusion would be that 
half the pack (i.e. 26 cards) would have to be turned up before 
two aces out of the four would appear. This is not so, however. 

The four aces divide the remaining 48 cards into 5 groups. If a 

large number of such divisions were made the average number in 
each of the 5 groups would be The average number of cards 
which must be turned up before two aces appear is therefore 2 of 
these groups plus 2 aces, i.e. -*5^ + 2 or 21-2, as on p. 272. The 
solution there given shows that 26 is neither the average number nor 
the most probable number, and, further, that there is a definite 
probability associated with each number of cards turned up. At 
the risk of labouring the point, it must be emphasized that these 
separate probabilities are only to be realized over a long series of 
trials. 

Example 6. 

A purse contains two half-crowns and three shillings ; a second purse 
one half-crown and four shillings. A coin is drawn from the first and 
placed in the second, and then a coin is drawn from the second and 
placed in the first. Assuming that the chance of drawing a half-crown 
is twice that of drawing a shilling, find the most probable value of the 
coins in the first purse after the second operation. 

At the end of the second operation the first purse may contain : 

(i) 2 half-crowns and 3 shillings; or 
(u) 3 .. .. i 

(iii) I half-crown ,. 4 „ 

The respective chances are found thus: 

(i) In order to achieve this result, a coin of the same value must 
be taken from the second purse at the second draw as was placed in this 
purse as the result of the first draw. 

Since the chance of drawing a half-crown is twice that of drawing 
a shilling, the chance that a half-crown is drawn from the first purse 
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originally is The second purse will then contain 2 half-crowns and 
4 shillings, and the chance of drawing a half-crown from this purse is 
The total chance that a half-crown is drawn on both occasions is 
therefore . 

Similarly, the chance that a shilling is drawn both times is y . 

The total chance that the coins in the first purse have the same value 
at the end of the two operations is therefore 

4 4,3 r, _ 2 9 

t-gty .7— -49. 

By proceeding similarly, it can easily be shown that the respective 
chances under headings (ii) and (iii) are -£‘g and 7 . 

The greatest of these values is | jj . 

The most probable value of the purse after the second draw is the 
value associated with this fraction, namely S shillings. 

Note that the probable value of the coins in the purse at the end of 
the second operation (i.e. the expectation of a person drawing the coins) is 

[(|B'xS)-(-(/jXQl) + (|-x6i)] shillings, or yfl shillings. 

In other tvords, it would be worth w'hile to give about ys. qd. for the 
purse after the second draw. 

Example 7. 

Cards are dealt one by one from an ordinary pack (without replace- 
ments) until two aces have appeared. Find (i) how many cards (on the 
average) will be turned up, (ii) the most probable number of cards to be 
turned up. 

(i) Since two aces cannot appear until the second trial at the earliest, 
we have 

Chance of success at the second trial = 
third „ 

.. fourth „ - 3 ■'6? ti to • A 


and generally 

r . , • , 1) (“iz-*) (^i - j:).4.3 

Chance of success at the wth trial = . 

52.51.50.49 

The average number of cards to be turned up wdll be the sum of the 
series whose rth term is 

.r ((x- 1) (52-^) (51 -*) — ^ — y 

( '52.51.50.49) 

for all values of x from 2 to 50 (since there must be two aces at least in 
the first 50 cards). 
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Summing this by ordinary algebraic or finite difference methods, it 
is found that the average number of cards to be turned up is 21-2. 

(ii) The most probable number to be turned up will be the value 
of .-e which gives 

( x - i )(52- a :) (51 - a ;) 4.3 
52.51.50.49 

its greatest value. 

The value for x = r> the value for jc = r — i so long as 

(r - l) (52 - r) (5 1 - r) > (r - 2) (53 - r) (52 - r), 

i.e. so long as -r^ + 52r-5i > -r2 + 55r- 106, 
i.e. so long as 55>3''> 

i.e. r is not greater than 18 (since r must be integral). 

The most probable number of cards to be turned up is therefore 18. 


EXAMPLES II 

I . Find the mean age of the persons in the following table ; 


Exact age 

20 

21 

22 23 

24 

25 26 

27 

28 

29 

30 i 

Number of 
persons 

43 

68 

120 150 

LSO 

130 80 

90 

95 

81 

70 

1 

1 


2. Calculate the mean and median in respect of the following 
distribution : 


Length in 
inches 

No. of cases 

Length in 
inches 

No. of cases 

57-58 

2 

65-66 

641 

58-59 

4 

66-67 

532 

59-60 

50 

67-68 

316 

60-61 

100 

68-69 

124 

61-62 

169 

69-70 

72 

62-63 

543 

70-71 

20 

63-64 

762 

71-72 

5 

64-65 

750 

72-73 

1 


3. The infant mortality rates for a certain year in the aggregates of 
the urban districts of the 61 administrative counties of England and 
Wales were as follows : 

30, loi, 100, 53, 57, 72, 87, 48, 63, 70, 49, 58, 79, 94, 50, 98, 69, 56, 
100, 55, 71, 84, 71, 95, 114, 85, 61, 108, 64, 76, 99, 63, 68, 97, 80, 68, 


F M A S 


18 





274 ELEMENTARY STATISTICS 

90. 75 . 65, III, 90. 66, 65, 85, 84, 82, 96, 99, 67, 71, 71, 91, 102, 80, 
89, 113, 74, 87, 79, 88, 73. 

Discuss methods of setting out more clearly the information conveyed 
by these figures. Plot a frequency diagram and determine graphically 
the mode. 

4. The following table shows the marks obtained by 100 candidates 
in an examination. Calculate the mean number of marks, and ascertain 
as accurately as you can the position of the median; 


Marks obtained 

Number of 
candidates 

1-5 

1 

6-10 

2 

ii-iS 

6 

16-20 

10 

21-25 

II 

26-30 

15 

31-35 

16 

36-40 

15 

41-45 

9 

46-50 

7 

51-55 

4 

56-60 

3 

61-65 

I 

100 


5. Estimate the mode from the data in Question 4. 

How would your estimate be affected if you were given in addition the 
following information? 


Marks obtained 

Number of 
candidates 

25 

3 

26 

2 

27 

I 

28 

6 

29 

3 

30 

3 

31 

4 

32 

4 
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6. Find, by drawing a frequency curve, the approximate values of 
the median and mode from the given data ; 


Value 

•5 i-s 2-5 3-5 4-5 5-5 6-5 7 5 8-5 9-5 10-5 11-5 

Frequency 

II 17 26 38 56 83 120 163 196 181 93 16 


How would you check your results? 

7. The following are the death rates per thousand per annum of two 
towns in a certain year: 



Town A 


Town B 


Ages 



Death 



Death 


Population 

Deaths 

I ate per 
thousand 

Population 

Deaths 

rate per 
thousand 

0-2 

3 .000 

192 

64-0 

5,000 

300 

6o-o 

2-10 

10,000 

70 

7-0 

12,000 

78 

6-5 

10-20 

10,000 

40 

4-0 

10,000 

38 

3-8 

20-60 

32,500 

260 

8*0 

25,000 

190 

7-6 

60 + 

8,500 

Sio 

6o-o 

8,000 

460 

57’5 

All 

1 

64,000 

1072 

1675 

60,000 

1066 

1 

17-77 


In each group the death rate ofJFown A is greater than that of 
Towti B, but the reverse is the case when all ages are grouped together. 
Explain clearly why this is. 

8. The exports of a certain country during a period of 22 years were 
as follows, in millions : 


Year 

Exports 

Year 

h-xports 

Year 

F^xports 

1904 

200 

1912 

205 

1919 

207 

1905 

198 

1913 

20s 

1920 

208 

1906 

197 

1914 

202 

1921 

200 

1907 

201 

1915 

199 

1922 

195 

1908 

202 

1916 

195 

1923 

193 

1909 

200 

1917 

197 

1924 

200 

1910 

195 

1918 

203 

1925 

208 

I9II 

199 

— 

— 


— 
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A writes to the paper to point out that trade is diminishing and gives 
the following figures : 

Average exports 1909-11 £198 

1915-17 £197 

1921-23 £196 

B replies that it is increasing, and gives the following figures: 

Average exports 1906—8 £200 

1912-14 £204 

1918-20 £206 

Write a short letter commenting upon the letters and pointing out the 
true significance of the figures. 

9. What is the difference, if any, between weighted mean and expecta- 
tion} Illustrate your answer by an example. 

10. Define carefully probable value and most probable value. 

A purse contains four half-crowns, three pennies and two shillings. 
Four coins are drawn at random. How many different sums can these 
amount to, and what is the most probable sum? (Assume that any one 
coin is as likely to be drawn as any other.) 

11. A coin is tossed until both head and tail have appeared twice. 

(1) On the average how many times will the coin have to be tossed? 

(2) What is the most likely number of throws? 

^ (3) How many throws must a man be allowed if the odds in favour 
of success are to be 7 : i ? 

12. Two persons throw an ordinary die alternately, and the first who 
throws 6 is to receive eleven shillings ; find their expectations. 

13. There are eleven tickets in a bag numbered I, 2, 3, ... 1 1. A man 
draws two tickets together at random and is to receive a number of 
shillings equal to the product of the numbers he draws ; find the value 
of his expectation. 

14. A bag contains eighteen exactly similar counters. Ten counters 
are each of value a, four are each of value b and the remaining four are 
each of value 26. A man draws two counters at random. It is equally 
likely that any counter will be drawn. Find the ratio of a to b, if the 
value of the man’s expectation is to be 40/3. 

15. Each of two bags contains w sovereigns and « shillings. If a man 
draws a coin out of each bag, is he more or less likely to draw two 
sovereigns than if all the coins were in one bag and he drew two 
coins? 
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‘ 16. Purse A contains six shillings and two sovereigns, purse B seven 
shillings and one sovereign. Seven coins are transferred from A to B 
and then seven coins are transferred from B to A. Which purse is now 
likely to be the more valuable? 

17. A bag contains thirteen counters marked with the squares of the 
first thirteen natural numbers respectively. 

(1) A man draws a counter and is to receive the number of shillings 

equivalent to the number on the counter. Find his expectation. 

(2) If the man is allowed to draw three counters and to reject the 

highest and lowest, find his expectation. 

18. A purse contains five half-crowns and four shillings. A pays 
55. 3<f. for the right to receive the value of three coins drawn at random. 
Criticize his bargain, and find the chance that, after two attempts, the 
second on the same terms as the first, he will be a winner. 

19. A bag contains m white balls and two red balls. The balls are 
drawn from the bag one at a time without being replaced until a red ball 
is drawn. If i, 2, 3, ... w'hite balls are drawn, A is to receive 1^, 2^, 3“, ... 
shillings respectively. Find his expectation. 

20. A bag contains twenty shillings and three sovereigns. Coins are 
drawn in succession, one at a time, w'ithout being replaced, until two 
sovereigns have been drawn. What is the probable number of shillings 
left in the hag? 

2 1 . Tiiere are ten counters in a bag marked w'ith consecutive numbers. 
Two counters are drawn from the bag. If the sum of the numbers 
drawn is odd, a man is to receive that number of shillings; if it is even, 
he is to pay that number of shillings. Find the man’s expectation 

(1) if the counters are marked from o to 9; 

(2) if they are marked from i to 10; 

(3) if they are marked from 2 to ii. 

22. A bag contains a coin of value M, and a number of other coins 
whose aggregate value is m. A person draws one at a time till he draws 
the coin of value M. .Assuming it is equally likely that any particular 
coin is drawn, find the value of his expectation. 

23. A bag contains twenty white balls numbered i to 20 and ten 
unnumbered red balls. A ball is drawn at random and replaced, six 
times. Find the probability that 

(1) at least three white balls are drawn; 

(2) three white and three red balls are drawn ; 

(3) three red balls and Nos. i, 2, 3 of the white balls are drawn. 

What is the most probable number of white balls drawn, and what is 

the probability that this number is drawn? 
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24. A man throws a six-faced die until he gets an ace. He is to receive 
if he succeeds at the first throw, jor. if he succeeds at the second 

throw, 6i. 8d. if he succeeds at the third throw and so on. 

Given log 2 = -3010300, 

log 3 = ‘477 1213. 

. log 2-718282 = -4342945, 

find the value of his expectation to the nearest penny. 

25. A and B have each two ordinary cubical dice, the faces being 
numbered from i to 6 on each die. They throw simultaneously until the 
whole of a stake of ^100 has been divided on the following conditions; 

(i) If A throws 7 before B has thrown 2, A is to take the balance 

of the stake then remaining. 

(ii) If B throws 2 before A has thrown 7, B is to take the balance 

of the stake then remaining. 

(iii) If simultaneously A throws 7 and B throws 2 the balance then 

remaining is to be divided equally between A and B. 

(iv) If at the nth throw neither 7 nor 2 has yet been thrown by A 

or B respectively, B is to take i/n of the balance then re- 
maining, except that this additional benefit to B docs not 
operate at the first throw. 

Find to the nearest penny the respective expectations of A and B. 
Given log 6 = -7781513, log 41 = 1-6127839, log c = -4342945. 



CHAPTER XII 


FURTHER PROPOSITIONS IN 
STATISTICS 

1 . In the preceding chapter we have defined three different 
averages of a distribution, and any one of these three averages may 
be used to typify the whole distribution. Thus, the mean gives an 
indication of the magnitude of the variates, the median tells us the 
position of the middle variate and the mode gives us the most 
frequent value. But these averages tell us very little about the dis- 
tribution itself. For example, if the sums assured under whole-life 
policies were in question and we were given a random sample of 
loo such policies, the mean of the sums assured might be £276, the 
median ^(^265 and the mode £250, From the mean, all that we 
could gather would be that the average sum assured was out 
of a total of ^27,600; from the median that there were as many 
policies with sums assured less than £265 as there were policies 
with sums assured greater than this amount; and from the mode 
that there were more policies for £250 than for any other individual 
amount. What we require to know in addition is the extent to which 
the individual sums assured vary from the particular average. The 
statistical term for this variation is the dispersion of the group. 

2. The range. 

The range of the distribution — i.e. the difference between the 
extreme variates — is a simple measure of dispersion. It would help 
us to some extent if we knew that in the example above the lowest 
sum assured was j^ioo and the highest £500; we should then know 
that the range w'as ;{j400. This measure is however not of very much 
value, for it would not add appreciably to our knowledge of tlie 
distribution if it happened that there was only one policy for £100 
and only one for ^^500. Another sample which gave the same 
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figures for the three averages might well have a smaller range, say 
from £150 to £450, notwithstanding that the most important 
frequencies, at the centre of the sample, were not very different 
from those at the centre of the first distribution. It is essential 
therefore that a more accurate and stable measure of dispersion be 
chosen. The range does not depend upon all the variates and the 
measure that is likely to be most useful is one which, as far as 
possible, does satisfy this condition. 


3. Mean deviation. 

If any particular average were taken and the deviations of the 
variates from this average were set down, it is obvious that the 
signs of some of the deviations would be positive and others 
negative. It might happen therefore that the sums of the deviations 
would be a small positive or negative quantity, or, in certain cases, 
zero, notwithstanding that the individual deviations were con- 
siderable. Consequently, if regard were had to sign, the algebraic 
sum of the deviations of the variates from, say, the mean would be 
of very little help as a measure of dispersion. As a result, in finding 
the total deviations from an average we ignore the signs of the 
individual deviations and treat them all as positive. We can then 
find the mean deviation, irrespective of sign. 

The mean is the average which lends itself most easily to arith- 
metical and algebraic treatment, and, as a rule, the mean is chosen 
as the origin from which to calculate the mean deviation. The mean 
deviation may, however, be found from any arbitrary origin. The 
following example gives a simple method for the calculation of the 
mean deviation from the median. 

Example 1 . 

Calculate the mean deviation from the median of the distribution in 
Example i of Chapter XI. 

The median is 59-8 years. Take the same origin as before (57 5) and 
assume that all the variates are concentrated at the mid-point of the 
group in which they lie. (See Note on p. 281.) 
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M^ — a = 59-8 — 57 S = 2'3 years = -46 class-intervals 


r 

Age 

Frequency 

/ 

Deviation from the 
median in terms of 
class-interval 
& as unit 

P > 

27-5 

1 

6-46 

6-46 

32-5 

2 

5-46 

10*92 

37-5 

4 

4-46 

17-84 

42-5 

10 

3-46 

34-60 

47-5 

21 

2-46 

51-66 

52-5 

53 

1-46 

77'38 

57-5 

126 

•46 

57-96 

62-5 

163 

■54 

88-02 

67-5 

35 

1-54 

53-90 

72-5 

6 

2-54 

15-24 

77-5 

1 

3-54 

3-54 

- 

422 


417-52 


Mean deviation from the median = - - = -9894 . . . class-intervals 

422 ^ 

= 4-95 ... years. 

Note. Strictly speaking, the assumption that all the variates are con- 
centrated at the mid-points of their respective groups should not be 
made for the group in which the origin lies. In this example, however, 
since the median is almost at one end of a group the result given abot'e 
is virtually correct. The extreme cases arise when the mean deviation 
is required from the middle of a large group. In that event special 
treatment is necessary for this group. 

4. Root-mean-square and standard deviations. 

The difficulty that some deviations are positive and some 
negative would be obviated if the deviations were squared. Then, 
by summing the squares, dividing by the total number of observa- 
tions, and taking the square root of the result, a form of mean 
de^'iation w'ould be obtained which would be independent of the 
signs of the individual deviations. This measure of dispersion is 
called the root-mean-square deviation. We shall denote this by the 
abbreviation r.m.s. deviation. 

If s be the r.m.s. deviation and the deviation of the frequency 

from the origin, then 
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The r.m.s. deviation possesses all the qualities of a good measure 
of dispersion; it is based on all the data, it lends itself to algebraic 
treatment and, in general, it is less affected by fluctuations of 
sampling than other measures. 

It has been shown that to calculate the mean without the use of 
a temporary origin may involve a considerable amount of arith- 
metic. This is even more marked in the case of the r.m.s. deviation. 
A formula similar to that obtained for tlie calculation of the mean 
from a temporary origin can, however, be found for the easier 
calculation of the r.m.s. deviation. 

Let a be a temporary origin and d be the deviation of the mean 
M from this origin. 

Let (J, be the deviation of observation from a so that 


S^ — X^-a^Xf+d, 

where is the de\nation of X^ from the mean. 

Then + 2f^4 +/r 

4 - 2dZf^r + dP'Vr . 

Now llfrXr is the sum of the deviations, positive and negative, 
from the mean, and is therefore zero. (See paragraph 6.) 


i.e. 


or 

where 




^fr 


^fr 






In order therefore to find r we may calculate the sum of the squares 
of the deviations from an arbitrary origin, so long as we know the 
deviation of the mean from that origin. 

If now the mean is taken as the origin, then d=o, and the r.m.s. 
deviation is a minimum. In these circumstances, the r.m.s. deviation 
is called the standard deviation. The standard deviation is the 
measure of dispersion that is most commonly used in statistical 
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investigations, and supersedes for all practical purposes the r.m.s. 
deviation, which depends on an arbitrary origin. 

The recognized abbreviation for the standard deviation is s.d. 
and its value is generally represented by a. 

Example 2. 

Calculate the standard deviation of the distribution in Example i . 
The method of calculation is as follows : 


Age 

n 

Frequency 

/ 

Deviation from 
mean 

S 

ys 

, 

/ 5 = , 

27-5 

I 

-6 

- () 

36 

32-5 

2 

-5 

— 10 

50 

37-5 

4 

-4 

- 16 

64 

42-5 

10 

-3 

- 30 

90 

47 -S 

21 

— 2 

- 42 

84 

52-5 

53 

— I 

- 53 

53 

57 -S 

126 

0 

-157 


62-5 

163 

-h 1 

+ 163 

1 163 

67-5 

35 

+ 2 

' — 70 

1 140 

72'5 

6 

-3 

-1- 18 

54 

77 -S 

1 

+ 4 

- 4 

16 


422 


+ 253 

750 




S/rV = 75 o and S/,. = 422. 

Therefore = 750/422 == 1-777, 

d = yX ' i^fr = (. 255 -i 57 ) 422 =-Z 32 - 

= 1-777 - -054 = I -723 
and o- = I ■ 3 1 class-interv'als 

= 6-56 years. 

5 . The standard deviation gives not onlv, as already explained, a 
general idea of tlie dispersion or spread of the observations, but 
also a definite numerical limit to the frequency of extreme cases, 
i.e. those in which the variate is much larger or much smaller than 
the mean. 
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We shall prove the following proposition : 

In a set of measurements, x^, ... whose mean is x, let the 

s.d. be a, so that 

iVcr* = sum of squared deviations = S [x^ — xy. 

Then the proportion of cases in which x — x (the deviation from 
the mean) is numerically greater than Act cannot exceed i/A^: i.e. 
there cannot be more than Nf)? cases in which x is either greater 
than 5 + Act or less than x—\a. 

Suppose that the actual proportion is p, so that there are pN 
such cases. In each of such cases the squared deviation {x — xY is 
at least (Act)*, so that the total squared deviations, arising from these 
cases, is at least pNX^d-. But this partial total cannot exceed the 
full total, AV*, derived from all the observations. 

Therefore pTVAV > Act* 

or /)>i/A*. 

Thus, if CT is small the observations will be heaped up round the 
mean, and their “precision” is great; while if ct is large the ob- 
servations will be widely spread, and their precision is small. 

It may be added that in the common case of a frequency dis- 
tribution (such as that of Example 2) which starts with small values, 
rises to much larger ones, and then falls to small ones, the actual 
value of p is usually much under the limit i /A*. 

This proposition is a simple case of Tchebycheff’s theorem. 
[Cf. W. F. Sheppard, J.I.A. vol. Liii, p. 82.] 

6. It is a simple matter to prove that the weighted algebraic sum 
of the deviations of a variable quantity from the mean is zero. 

For, if a be the origin,/, the frequency of an observation A',, 

then M = so that = MS/, . 

Also, if 5 , is the deviation of AT, from a, 

S/,Ar, = S/, (a + (J,), since (J, = AT, - a, 

= uS/, + S/, 5 „ 
i.e. = 

and M-a = Xf,S,IZf,. 

If therefore the mean be taken as the origin, M=a and S/,^, = 0. 
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7. We have shown that the r.m.s. deviation is a minimum when 
measured from the mean. It is instructive to show that the mean 
deviation is a minimum when measured from the median. 

Let the frequency distribution be /i,/2,/3,/4, ••■/r. ■■■fn with 
class-interval a, and let ^f^ = n. We may illustrate the distribution 
graphically, thus: 

.Y 

L ^ i ^ .L _ ^ I t I ^ L t I 

/* ft J* Jm /m+i Jn 


Let dy be the mean deviation from the temporary origin X 
distant c {< a) from the beginning of the class range m to m -f i . 

Then dj = ^ {/„c +/„_i {a -P c) (2a -h c) -f . . . -i-/i {m-i a + c) 


+f,nri - c) +/m+2 (2« -c)-t ... +f„ (n-7na + c)}. 

If now the origin is changed to m the mean deviation is d^, where 

« 

^2=^{/m0+/m-l^*+/m-22«+ ••• +A {m-ld) 

+fm4.ia+fm^22a+ ... +/„ (n-ma)}. 

“(/m+l^+/m+2C+ ••• +/;i0} 



cZf,- 

1 


m f 1 



= ~( 2 M-n), 
n 


m 

where AI='Lff. 

1 

^2 = Jj-f- (n — 2 A/), 

which <dj if M>hn, i.e. if the median lies to the left of /„,. 

Therefore, if n is even, the mean deviation is constant for all 
positions of the origin between the ^nth and (i?j-(-i)th observa- 
tions, and this value is the least value. If n is odd, d is least when 
the origin coincides with the J (n-1- i)th observation. That is, the 
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mean deviation is least when measured from the median, or if the 
median is indeterminate, from an origin in the class range in 
which the median lies. 

8. Semi'inter-quartile range. 

We have seen that, in an array of variates, the observation which 
is such that as many observations lie on one side of it as on the 
other is a particular form of average, i.e., the median, which can be 
used to obtain a measure of dispersion. There are certain observa- 
tions other than the median which divide the array and which may 
also be used to obtain such a measure. The observation which is 
such that one-quarter of all the observations lie below it in magni- 
tude is called the lonjcer quariile, w'hile that observation which is 
such that one-quarter of all the observations lie above it is called 
the upper quartile. In the example on p. 255 the lower quartile is 
the height of the (hypothetical) man between the tenth and eleventh 
man, and is therefore 5 ft. 6 in., while the upper quartile is the 
height of the (hypothetical) man between the thirtieth and thirty- 
first man and is therefore 5 ft. 8 in. 

The quartiles, like the median, can be estimated by first difference 
interpolation from frequency distributions, or from a consideration 
of the ogive curve, and the student will find it a useful exercise to 
obtain the quartiles from the examples given in this and the 
preceding chapter. 

From the definitions given above we see that if and ^3 are the 
lower and upper quartiles respectively, one-half the total frequency 
lies between and ^3 . This range, namely Q3—Q1, is called the 
inter-quartile range, and it is an even chance therefore that an 
observation taken at random will lie within this range. 

A simple measure of dispersion, and one which can be calculated 
with little difficulty, is the quartile deviation, Q, such that 

It is worthy of note that where the distribution is symmetrical, 
or nearly so, it is generally found that Q is approximately equal 
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The quartile deviation is generally called the semi-inter-quartile 
range and is an approximate method of estimating the dispersion 
of a frequency distribution. 

9. Relative measures of ilispersion. 

The measures of dispersion described in the previous paragraphs 
are all absolute and relate to the individual distributions only. In 
order to compare the dispersion between two different distributions 
\\’e need a measure of relative dispersion. Thus, the mean of a 
sample of sums assured may be ^317 and the standard deviation 
£10, while the mean of another sample may be $620 and the 
standard deviation $30. It does not follow, however, that the 
variability in the second case is greater than that in the first. We 
cannot compare the variability of the two distributions without 
eliminating the absolute values of the measures of dispersion. To 
obtain a relative value we express the particular measure of dis- 
persion as a ratio of the average on which the dispersion is based. 
Thus, to compare two or more standard deviations we calculate the 
standard deviation as a percentage of the mean, and obtain thereby 
the coefficient of variation. If, therefore, a is the standard deviation 
and M the mean, the coefficient is iooa,d/ for each distribution. 

The following is a simple example involving the use of the co- 
efficient of variation. 

Example 3 . 

In order to pass a certain examination candidates must obtain at least 
40 marks. All successful candidates with less than 50 marks are placed 
in the third class. The distributions of those who passed in the third 
class in two successive vears being given in the table below, find which 
was the more consistent set: 


Marks 

40 

41 

42 

4-1 

44 

4.S 

46 

47 

'48 

49 

No. of successful 
candidates in the 
first year 

1 


5 

10 

4 

1 

4 

5 




No. of successful 
candidates in the 
second year 




7 



3 

2 

6 

6 

2 
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The more consistent set may be regarded as the one for which the 
coefRcient of variation is the smaller. (See, however, Note below.) 


No. of 

First year 

Second year 

marks 

/ 

8 

ys 

y8» 

/ 

8 


/S> 

40 

1 

-3 

- 3 

9 


— 

— 

— 

41 

— 

-2 

0 

0 


— 



— 

42 

5 

— I 

- 5 

5 

— 

— 

— 

— 1 

43 

10 

0 

- 8 

0 

7 

-4 

-28 

II 2 ] 

44 

4 

+ * 

+ 4 

4 


-3 

0 

0 

45 

I 

+ 2 

+ 2 

4 


— 2 

- 6 

12 ' 

46 

4 

+ 3 

+ 12 

36 


— I 

“ 2 

2 i 

47 

5 

+ 4 

+ 20 

80 


0 

-36 

0 1 

48 

— 

~ 

— 

— 


4 1 

+ 6 

^ i 

49 

— 

— 

— 

— 


4 - 2 

+ 4 

^ 1 

— 

— 


+ 38 

— 


— 

-r 10 



30 

~_j 

30 

00 

26 

— 

-26 

L 

140 1 


First year : M = 43 + 30/30 = 44 . 

o=V 138/30-1 =v' 108/30 =V 3-6 = 1.897. 

Coefficient of variation = 1 89-7/44 = 4 ’ 36 . 

Second year : A/ = 47 — 26/26 = 46, 

140/26- I =v' 114/26 =v' 4-3846 = 2-094. 

Coefficient of variation = 209-4/46 = 4-47. 

The second coefficient of variation is larger than the first, so that the 
more consistent set was that in the first year. 

Note. Care should be taken in using the coefficient of variation as 
a measure of consistency. It might not be safe to use it if, for example, 
given two sets of frequencies, and /r = ^r constant, for all 
values of r. 

10 . Corresponding relative measures for other absolute measures 
may be formed by dividing by the corresponding averages. The 
mean deviation — Af,)/n gives a relative measure of 

S — while the coefficient of variation for the quartile 

deviation ntay be taken as ^ {Q9—Qi)l{Qs + Qi)- 
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*11. Standard error and probable error. 

Before the characteristics of different distributions can be com- 
pared by means of calculated statistical measures, certain considera- 
tions must be borne in mind. One distribution may be a random 
sample drawn from a large parent population, and it is to be 
expected that another sample of the same size would produce 
results conforming to those of the first sample and of the parent 
population, were they available. The results may, however, differ. 
In that case we must find the degree to which they are reliable, or 
alternatively, to what extent the observed differences arise simply 
from errors of sampling and errors of chance. Consider, for 
simplicity, the value of the mean calculated from a sample. This 
value is one of a distribution of means calculated from all p ssible 
samples of the same size, which lie on a frequency curve of w'hich 
the s.d. is found to be cr'/y';;, where a' is the s.d. of all the ob.ser- 
vations in the parent population and n is the number of obserx’ations 
in the sample. In practice a' is not know’n, and as an approxima- 
tion we use cr, the s.d. derived from the sample. In this case, aj \'n 
is known as the standard error of the mean. 

For reasons connected whth the normal curve of error (see p. 290) 
the standard error is often multiplied by -6745 to produce the 
probable error. 

We may explain the use of the probable error in the following 
manner. 

If two values of a variable x (say a and b) are given and it is 
required to find the chance that a value of the variable selected at 
random lies between these limits W'e may construct a distribution 
curve of unit area, and this curve will be such that the area between 
the ordinates a and b represents the chance required. Distribution 
curves may be constructed either from data given for a question in 
mathematical probability or from statistical data obtained from the 
results of a series of observations. We may take, for example, the 
binomial curve. A simple conception of this curve results from 
calculating by the mathematical definitions the number of heads 
that turn up when m coins are tossed. The relative frequencies are 


r M A S 


19 
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given by the successive terms of the binomial series and 

by plotting the relative frequencies against the number of heads we 
obtain a bell-shaped curve symmetrical about the mean. When m 
is very large it can be shown that this curve approximates to a 
curve known as the normal curve of error. 


M 



In Fig. 12, which is in the form of the normal curve of error, 
NM is the ordinate through the mean of the means. PQ and 
P'Q' are the ordinates through points at distances from N such 
that PN and P'N are equal to the probable error of the means. 
Then it may be shown that the area of the curve cut off by PQ, 
P'Q' and PP' is half the total area. Ordinates drawn through 
distances from N equal to twice the probable error would include 
most of the curve, while ordinates through three times this distance 
would include practically the whole curve. 

As a result of this property of the normal curve it is usual in 
practice to say that it is improbable that a statistical measure 
calculated from a sample will differ from the value which would 
be obtained from the parent population by more than three times 
the probable error. Thus, if the difference between two calculated 
means exceeds three times the probable error of the difference, it 
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is assumed that this difference is unlikely to have arisen from errors 
of sampling, but is due to intrinsic differences in the two samples. 
The difference is then said to be significant. 

12. We have already seen that the calculation of mathematical 
expectation can be reduced to tbe determination of the weighted 
mean. Also, if we have to find the value which occurs with the 
greatest frequency from a consideration of ideal objects— the most 
probable value — we have a parallel in the mode of a statistical dis- 
tribution. Further, deviations from the mean in a statistical ques- 
tion correspond to deviations from the expectation in a problem in 
probability. 

This parallelism may be extended. Equally likely events have 
their counterpart in equally likely intervals or ranges. Just as 
equally likely events are rare in practice, so ranges are seldom 
equally likely. For example, a simple question in mathematical 
probability is: “Two clerks are in an office: one goes to lunch 
between 12 and i and the other between i and 2. Each takes an 
hour for lunch. Find the chance that they are not out at the same 
time.” Unless it be assumed that all times between the given limits 
are equally likely, the problem cannot be solved as it stands. But 
it can hardly be doubted that, in practice, all such times would not 
be equally likely — for a variety of reasons. Similarly, in statistical 
problems there will be certain intervals or ranges which are much 
more likely to occur than others. Consider, for instance, the case 
of a lamp filament. The resistance of such a filament cannot be 
forecast with certainty, and if the resistance is to be, say, 300 ohms, 
it is more probable that the resistance of a filament made for this 
purpose will lie between 300 and 310 ohms than between 390 and 
400 ohms. Again, the average height of the men in a given com- 
munity is 5 ft. 7 in. A man chosen at random from the community 
is more likely to be between 5 ft. 6 in. and 5 ft. 7 in. than between 
6 ft. I in. and 6 ft. 2 in. Many such examples occur in statistical 
problems. 
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EXAMPLES 12 

I. The following table shows the proportion of male births per 1000 
births in a certain community. Find the mean and standard deviation 
of the distribution; 


Number of male births 
per I coo births 

Number of cases 


2 

477 - 

6 

4S9- 

43 

501- 

399 

51.1- 

176 

5 ^ 5 - 

9 


2 


2. Calculate the mean age, the median age and standard deviation in 
terms of year of age from tl)e following data : 


^ Agc-gioup 

Number of cases 

5 - 

20 

10- 

300 

15- 

860 

20- 

435 

25- 

230 

30- 

LSo 

35- 

70 

40- 

60 

45- 

35 

50- 

20 

55- 

10 

60- 

7 

65- 

3 


3. Find the values of the upper and lower quartiles. 


Value j -5 '1-5 2-5 3-5 

4-5 

! 

6-5 j 7-5 

8-S 

9-5 

10-5 1 11-5 

Frequency ^ 

22 i 34 j 52 j 76 

! 12 

1 

166 ! 

1 

240 1 326 


362 

186 1 32 


4. Estimate roughly the mean wage from the following data; 
Median 37s., Quartiles 29-55. and 40-55. 6 per cent, of the observations 
are less than 205. per week and 3 per cent, are 455. per week or more. 
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5. Illustrate by a diagram the following distribution : 


Length in in. 

o~ 

I- 

2 - 1 3 - 

r 

4- 

T 

5- 1 

6- 

7 - 8 - 9 - 

10- 

II- 1 

No. of cases 

I 

1-5 

2-5 4 

5-5 

8-5 i 

12 

16 1 20 18 

1 

9-5 

1-5 ' 


Estimate the mode, median and upper and lower quartiles. Calculate 
the mean, and check approximately the relative values of the mean, 
mode and median. 

6. If m a series of measurements we obtain m-^ values of magnitude x-i , 
m2 of magnitude X2, and so on, and if x is the mean value of all the 
measurements, prove that the standard deviation is 

V Em, 

where x = k+S. 

Taking k as 200 apply this formula to obtain the standard deviation 
of the exports given m the adjoining table ; 


Vear 

r xports m 
^ (millions) 

Year 

Exports in 
£ (millions) 

Year 

Exports in 
£ (millions) 

1904 

200 

1912 

203 

1919 

207 

1905 

198 

19:3 

205 

1920 

208 

1906 

197 

1914 

202 

1921 

200 

1907 

201 

1915 

199 

1922 

195 

1908 

202 

1916 

195 

>923 

193 

1909 

200 

1917 

197 

1924 

200 

1910 

195 

191S 

203 

1925 

208 

I9I I 

199 






*7. Given the following statistics, find the probable error of the mean: 


Height of man 

No. of 
cases 

Fieight of man 

No. of 
cases 

5 it. 3 ill. 

2 

5 It. 9 m. 

16 

.S 4 

2 

5 10 

16 

5 5 

2 

5 II 

14 

5 6 

5 

6 0 

7 

5 7 

12 

t> I 

2 

5 8 

*9 

(> 2 

•> 


What use is made of the probable error? It you measured another 
body of men and found that the mean height was 5 ft. 7 in., with nearly 
the same probable error, what conclusion would you draw? 
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•8. Find the probable error of the mean : 


Age-group 

Number of cases 

IS- 

5 

20- 

83 

25- 

157 

30- 

162 

35 - 

147 

40- 

125 

45 - 

105 

50- 

80 

55- 


60- 

40 

65 - 

22 

70- 

14 

75 - 

4 

80- 

1 


9. Find the average wage, the median and the standard deviation of 
the following distribution : 


Weekly wage 

32r. (id. 

y^s. (id. 

42s. 6(/. 

47s. bd. 


4 

3 

6 

10 

Weekly wage 

52s. (id. 

57 J. bd. 

62i. bd. 

byi. bd. 

Number of cases 

12 

16 

13 

I I 

Weekly wage 

725, hd. 1 77s. iid, 1 825. bd. ^ 87^. bd. 

Number of cases 

8 

8 1 4 1 3 

Weekly wage 92^. (id. 

gys. bd. ' lozi. bd. | 

Number of cases j i j 0 , i 

1 


10. The scores of two golfers for 24 rounds each are 

A 74. 75. 78. 78. 72. 77. 79. 78, 81, 76, 72, 72, 77, 74, 70, 78, 
79, 80, 81, 74, 80, 75, 71, 73. 

5 86, 84, 80, 88, 89, 85, 86, 82, 82, 79, 86, 80, 82, 76, 86, 89, 
87, 83, 80, 88, 86, 81, 84, 87. 

Which may be regarded as the more consistent player? 
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II. A sample of the numbers of persons engaged in a certain occupa- 
tion is as follows ; 


Age last birthday 

Number 

14 and 15 

13 

16 and 17 

35 

18 and 19 

41 

20- 

108 

25- 

176 

35 - 

204 

45- 

190 

55- 

65 

60- 

39 

65- 

21 

70 and over 

12 


Find the standard deviation. 

12. What is meant by the term “Coefficient of Variation”? 

During the hrst 20 ivee-ks of a session the maris of two students 
taking the course were 

A' 5?i, 59, 60, 54, 65, 6(), 52, 75, 69, 52, 65, 66, 56, 42, 67, 30, 
SO, 60, 46, 48. 

Y 56, 87, 89, 78, 71, 73, 84, 65, 66, 46, 84, 56, 92, 65, 86, 78, 
44 ' 54 > 7 ^' 

Which would you consider was the more consistent? 



CHAPTER XIII 


MEAN VALUE. THE APPLICATION OF 
THE CALCULUS TO THE SOLUTION 
OF QUESTIONS IN PROBABILITY 


1. It has already been seen that the mean of a number of quantities 
is simply the arithmetic average, weighted if need be. Thus, if there 
are n quantities <f> (a), <l>(a + Aj), (j>(a + h^), ...<f>{a + j), then the 

mean value is 

^ (n) + ^(n-{"Aj) + (^(n + A,) + (n + A,|_.|) 
n 

Suppose that y = (f>{x) is a function of x, and that x has the 
n successive values a, n + A, a + 2 A, . . . n + (n — i ) A. Then the mean 
value of <f> (a;) for these « values from x = a to a: = « + (n - i) A is, 
as above, 

(u) + (n 4" A) + ^ (n 4" 2 A) 4-...4-(^(n + w — lA) 
n 


Let b = a + nh &o that nh — b — a. 


r-n-l 

Then the mean value =--7 <l>(a + rh) 

nn r-fl 


b — a 


*n — 1 


D (a + rh). 
r= 0 


If X varies continuously between a and b so that the number of 
values, n, tends to infinity, the mean value becomes 


h[^{a)+<f){a + h) + ... +(ji (a + n- ih)] 

Lt r 

n— 00 ^ ^ 

It should be noted that, where the function is continuous, the mean 
value depends on the law governing the selected values. l''or example, 
the mean value of the ordinate of a semicircle determined by ordinates 
passing through equidistant points along the diameter is different from 
the mean value determined by taking equidistant points along the 
circumference. It thus appears that the mean value of a continuous 
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function <f) (x) is not a definite quantity but a quantity varying according 
to the law assumed for the successive values of x. 

This point is illustrated in Example 3 . 

2 . This application of the integral calculus enables us to solve 
many problems involving mean values. The solution of these 
problems can generally be effected by the use of single integrals, 
although some of the more difficult questions necessitate the use 
of double integration. The three following examples illustrate the 
use of single integrals and, as will be seen, the solutions present 
little difficulty. 

Example 1. 

Find (i) the mean value of the ordinate, (ii) the mean value of the 
square of the ordinate of the curve y = asinnx for the range * = o 
n 

to X = - . 
n 

(i) We have to find the sum of the ordinates over the given range 
divided by the number of ordinates. Since the number of these ordinates 
will tend to infinity as the distance between them tends to zero, we 
shall have -n n 

\ r ^ 

vdx asinnxdx 


a 

— cos nx 

ti 


7T 


[■ ' 71 


X 


1. J 0 


a 

a 

— cos IT + 

COSO 

« 

n 

77 


n 



a 

-- (- 1 ) V 


n 

n 

— . sine 


, since cos rt = 


77 
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(ii) Similarly 


PROBABILITY 

ir 

[ a^sin^nxdx 
. Jo 

r* 

Jo 

ir 

I a^\{i—co&2nx)dx 

Jo 


ia^x—ia^ *-sin2nA;| 

- - J 

n 

n 

= h=ha\ 

211 j n * 


Example 2. 

A straight line of length a is divided at random into two parts. Find 
the mean value of the rectangle contained by the two parts. 

Let X be the length of one part; then ta - a: is the length of the other 
part. The rectangle contained by the two parts is x[a — x), and we 
have to integrate this function with respect to x. Also, since x may 
have any value from o to o, these values will be the limits of integration. 

Hence, 


■ a 

X {a — x) dx 
. 0 


[ {ax — x^) I 

Jo_ 

(“rfx 

Jo 

Vax^ 


2 3 Jo 


0 
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2 ~ 3 
a 


6 ' 

Note. Since we are required to find the mean value of an area, the • 
result must be of the second degree in a. 

Example 8. 

A ship steering a straight course at a uniform speed picked up a 
stationary object 2 miles away with the beam of a searchlight, the beam 
making an angle of 45° ahead with the course of the ship. The beam 
was kept on the object until it made an angle of 60° astern with the 
course. Calculate the mean distance of the object during this time. 


o 



Let O be the object and AB the course of the ship. Draw OD 
perpendicular to All. 

'I'hen the lengths of the various hues in the figure are OA = 2, 
AD- \/2, OD= Vz, DB= and 0 /i = 2 v 2 /v' 3 - 

Let r be any position of the ship, distant .v from D, so that OF^ = 2 + 
Then S, the sum of all lengths such as (JP, 

I ^ 'A V a _ __ 

- Vz+x'dx 

J-V2 

~ i + 2 + 2 log f.V + Vx- + 2)] ' * 

— V 2 

which, on evaluation, 

= 1+ v'2 + log(v6+v'3). 
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, fV2/v'S 

/. M.V. = 5y'J dx 

s+ V2 + 1or(-v/6+ V3) 

It -will be noted that the variable has been taken as the distance from 
the point D. If either of the angles POD or OPD had been selected as 
the variable an erroneous result would have been obtained. This can 
be seen by a consideration of the argument below. 


o 



The use of the angle at the vertex O as the variable implies that this 
angle changes uniformly during the progress of the ship. If the ship 
were sailing along the arc WXYZV of the circle round O, equal 
variations of the angle at O would correspond to equal distances along 
the arc, i.e. to a uniform speed of the ship. 

The ship, however, is sailing along AB. 

If the ship were to sail along AB so that the angles at O altered at 
a uniform rate, the ship would have to travel from A to E in the same 
time as from E to F, F to G, and G to B. In that event the ship would 
not be travelling at a uniform speed along AB. 

If the mean value is looked upon as the result of drawing lines from 
O to the base AB at equal intervals and taking the total length of all 
the lines divided by the number of lines when this number is in- 
definitely increased, it is obvious that the result when the lines arc 
drawn so' as to divide the base AB into equal parts will not be the same 
as when they are drawn so as to divide the angle AOB into equal parts. 

Similarly when the angle is taken at the base. 
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3. The use of double integrals. 

Consider first a function, ^ (x), involving a single variable x. 
Then the mean value of 0 (x) between the limits x = a and x = b is 

x=‘h 

S ^(x)/ V 

x^a ■ s a 


If there be a function ip {x, y) of two variables a and y, we may 
write similarly 

x^b y-p x = b v-j9 

M.v.= i: i:./.(.r,r)/ s I, 

y—OL < x=o j/=a 

where x and y proceed by small but finite interv als. 

If these intervals, say i n and i/w for x and y respectively, be 
very small, the numerator and denominator of this fraction will 
be very large. 

Multiply both numerator and denominator by (i/n. ijm), so that 
the fraction is 


T-h j j j-by-f , . 

S i: ./<(.v,y) -/ ^ ^1--. 

x^a y -a tZ 1U x— a v- x t/Z 

Replace the small quantities ijn and 1 m by l^x and Ay, and 
find the limit when \x and Ay each tend to zero. I'hen, since the 
limit of a quotient is the quotient of the limits of the numerator 
and denominator (provided that the limit of the denominator is 
not zero), we have 

x=b J/ — 5 

A A ./-(.V, v) AyA.v 
M.\.-Lt - ^ ^ - , 

A A 1 AyAv 

j*=a y- X 

whczi A.r, Ajy each tend to zero. 




i/f (x, y) dydx 




dydx 


The following examples are illustrative of the method of appliat- 
tion of double integrals. 
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Example 4. 

A rod of length a is divided at random into three parts. Find the 
mean value of the sum of the squares on the three parts. 

Let OP be the rod of length a. Take any point X in the rod distant 
* from O, and another point Y distant 

y from X. The squares on the three q ", , p 

segments of the line will be y*, X ^ Y 

{a-x-yY, respectively. " ^ 

Let X be fixed. Then y will vary **' 

from o to a — x, so that the total values of the sum of the squares 
OX^, XY*, YP^ will be 

Z [x“+y* + (a-x-y)*], 

where y has every value from o to a — at. 

Now let X vary. The limits of variation of x are evidently from o 
to a. Then 


M.v. : 


a ra—x 

0 Jo 


[at* +y* + (a - x dydx 


Jo Jo 


dydx 


ra ra-x 

(2x^ + 2y''‘-t-a''^-zax-2ay + zxy)dydx 

Jo Jo 


f” 

.lo 


ra -a-x 

dydx 

Jo Jo 

zx^y + + a*y — zaxy — ay* + ary* 

a r ’] a 

J 0 I Jo 


dx 


dx 


( (a — x) + f (fl — (a — jc) - zax {a — x) 

JO 

- G (a - + X (fl - x)^] dx 

“ ' * 
{a~x)dx 

J 0 

which becomes, on evaluating the integral, 

a* 


2 
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It is important to note that if the sums of the required values can be 
obtained by considering separate sets of the values, the total sum must 
be found and divided by the total of the values. For example, if 
we could best solve a mean value problem by summing {x) for ail 
values of x varying continuously from o to a, and ift (x) for all values 
from a to b, then we must write 


and not 


[ <l> (x)dx+ \ Ip (x) dx 

__ J_0 J a 

~ 1 “ 1 

dx+ \ dx 
Jo Jo 

rp (x) dx i Ip (x) dx 

" + 12 

ra rb 

\ dx \ dx 

Jo Jo 


'I'hc fallacy in the second expression is easily seen when we consider that 

A B . . A + B 

+ ^ IS not necessarily the same as 


Example S. 

Find the mean value of the distance from one comer of a square to 
any point in the square. 

Let OABC be the square. Take any point X in the side OC distant x 
from O, and draw XM parallel to the side 
CB to meet the diagonal OB in XI. Let ^ 

Y be any point in XM distant y from X. Y / 

The length OF = V*' +y" and ^ ~7 Xbl 

XM=^OX = x. / X 

For a fixed value of x the sum of all values / / 

of OF, i.e. of + will be // 


Vx^+y-, 


Fig- i6. 


since F may take up all possible positions 

on the straight line XM. But x may have all values from o to a. 

Therefore the sum of the distances from the comer O to any point 
in the triangle OBC 


= f ( Vx^+v'-dydx. 

J 0 JO 
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For the sum of the distances from O to any point in the square OABC 
we must double this. The required mean value is therefore 

2 f I Vx^+y^dvdx 
Jo .'0 ' 

2 I “ r dvdx 

.0 Jo 

= - a"[\/2 + log(i + t^2)](/*, 

<2 Jo 

on integrating with respect to _>• and inserting the limits o and x, 

= ^ [ V2 + log ( I + v'2)]. 

4. Application of the calculus to probability. 

When we are dealing with problems in probability where the 
number of cases involved depends upon magnitudes varying 
continuously over a given range, the method of approach is similar 
to that outlined above for the solution of mean value problems. 
The general principle is to take the quotient of the number of 
favourable ways by the number of possible ways, where all ways 
are equally likely. 

The application of the integral calculus to problems in probability 
is best illustrated by examples; as a general rule it is sufficient to 
employ single integrals, although in some instances it is of advantage 
to use double integration. Many problems can be solved by either 
method, and examples of both methods are given below. 

Example 6. 

A line of given length is divided into three parts by two points taken 
at random. Find the chance that no one part is greater than the sum of 
the other two. 

We shall adopt the method of single integration for the solution of 
this question. 

Let one of the random positions 

be at jP distant x from the end A of yj 1— 1 1 — g 

the line, and let v4C' = CB= la. <- — x--- > ^ ^ 

(i) Consider the favourable cases 
in which AP ( x) is less than Ja. 

Take a point Q in the line such that PQ = \a. Then, for the conditions 
of the problem to be satisfied, the other random point R must lie in 
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CQ (otherwise PR or RB will be greater than half the line and conse- 
quently greater than the sum of the other two parts AP, RB or AP, PR). 

Now P lies in the small part dx between distances x + dx and x 
from A, and since P has been taken at random, the chance that it falls 

in this small part is ^ . 

a 

The chance that R lies in CQ is 

CQ ^ AQ~AC _x+PQ-AC _ x 
a a a a' 

since PQ and AC are each ^a. 

Therefore the total chance that P falls in dx and R in CQ is 

f * dx 
Jo a' a' 

the limits of x being o and ia. 

(ii) Consider the favourable cases in ■d ^ B 

which AP{-x) is greater than \a. < x > 

Then, as altove, R must lie in QC, 

the chance of which is ' ' 

QC _ PQ - CP _\a — x + \a _a-x 
a a a a ‘ 


Therefore, since the limits of x are now and a, the chance that P 
falls in dx and R in CQ is 

r ” a — .V dx 

Jjo a ' a' 

The total chance required 


fb'o: dx f“ a — * dx 
a' a a ' a 





Example 7. 

Two points are selected at random on a line of length a. What is the 
probability that none of the three sections into which the line is thus 
divided is less than 


r M A s 


20 
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As an alternative this question will 
integrals. 

Let AB be the given line divided 
into four equal parts at C, D, E. Now 
if P and Q be the random points, then 
to satisfy the required conditions 
neither can he in AC or EB. 

Let P be at distance x .from A. Then the limits of * will evidently 
be Ja and ^a. Let Q be at distance y from A ; then Q can take up any 
position from \a along the line from P to the point E. As the origin is 
at A, the limits of y will be ^a+x and fa. 

Again, when P is fixed, all possible positions of Q will be from P 
to B, i.e. y can vary from x to a. Obviously x can vary from o to a. 

Then by the unitary definition, the required chance is 


be solved by the use of double 


P 

-t-f- 


c ■ /> 

Fig. IQ. 


Da ill’ 


} a 

J_± 

ra ra 


dx 


JoH/* 

a 

f [|a- Ja-x]d* 

^ a 


.0 


(a — x) dx 


fax - \ax - ix^ 


ax—ix‘ 


on inserting the limits and simplifying. 


Note. Each of the above two problems can be solved by the different 
methods here demonstrated. The two questions are exactly similar, and 
with the necessary alterations in the limits precisely the same working 
can be applied. 


Example 8. 

Two independent events, A and B, must each happen once and once 
only in the future. The chances of their happening in the interval from 
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t to t-Vdt are proportionate to a*dt and h^dt respectively, where t is 
the time elapsed and a and b are constants (positive fractions). Find 
the chance that the two events happen in the order AB. 

Let the chance that A happens at the moment of time dt be ka^dt 
and the chance that D happens be lb‘dt. Then, since the events must 
happen, we have 


■<» roo 

ka'dt = i and 
. 0 Jo 


lb‘dt=i. 


This gives k — — log a and / = — log b. 

Now the chance that B happens between now and time i from now 

= Ih^dx. 

J 0 

Therefore the chance that B has not happened by that time 


=-j; 


Ib^dx. 


The chance that A happens at the moment of time dt 

= ka*dt, 

and the chance that A happens at that moment, B not having happened, 
= ( I — j llf^dx ) ka'dt. 

' Jo 

Therefore total chance that the events happ)en in the order AB 

ri 


J 0 


which becomes 


I — ib^dx ka'dt, 

Jo 

log a 


log a + log b 

on substituting for k and / and evaluating the integrals. 


Example 9. 

In a certain year A and B were in London for one period only in 
each case, A for one-third of a year, B for one-quarter of a year. As- 
suming that in the case of A any one period of one-third of a year 
and in the case of B any one period of one-quarter of a year is as likely 
as any other period, find the probabilities that 

(1) A was in London the whole of the time that B was; 

(2) A and B were not in London at any moment together ; 

(3) A and B were in London at some moment together; 

(4) A came to London before B. 


30-3 



3o8 probability 

(i) The chance that A arrived in London at point of time between 

t and t + dt from the beginning of the year is y, since he must have 

arrived in the first two-thirds of the year. 

The chance that B arrived in the one month permissible = ^ . 

I 9 

Therefore the chance that A was in London the whole of the time 


, „ 1 dt I 

that B was = - . 

- 1 0 9 S 9 

(2) The chance that A and B were not in London together = (a) the 
chance that B arrived after A had left-f(i!>) the chance that B left 
before A arri\'ed. 

For (a) B must have arrived between times t + J and i.e. in the 
space of time ^ — The limits of i are o and 

5 

I* 1 2 _ / ^4 2 c 

Therefore chance = t = 

Jo I i 144 

For (h) B must have arrived between the tijnes o and f — i, and the 
chance of this event, namely that B left before A arrived 


^ t-\dt 
1 f ?~'44' 


Total chance under this head = f 

(3) This is evidently the complement of (2) and is 


I— 4:1 or 


7 " 2 - 


(4) If A came to London before B the chance is evidently 


^ 1 f 9’ 


5. Geometrical eoludons. 

Many of the above types of question can be solved by the aid 
of geometry. Since a definite integral represents an area, the ratio 
of the number of favourable ways to the total number of ways 
when there is continuous variation between the limits can evidently 
also be solved by relating the areas enclosed by parts of curves. 

These curves or parts of curves may take the form of rectilinear 
figures, and in this event the problem can often be solved in a 
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simpler manner by the use of geometry than by having recourse 
to the methods of the calculus. 

Consider for instance Example 7 (p. 305) : 

If the straight line be divided at random into three parts 
X, y, a — x—y, the following must hold to satisfy the conditions 
of the problem: 

x+y<a-, x,y, a — x—y ciLch> ^a. 

We have 

y > |aj 

and a — x—y>{a, 

giving x+y<'^a. 



Draw the straight lines LM, .v+jy = a and NP, x + v = |fl and 
complete the diagram as shown: 

OM = a = OL\ OA = la\ OB=^\a\ AP^\a. 

Then the conditions above may be illustrated thus: 

.x:> means that tlic points must be to the right of AD\ 

y > la means that the points must be above BC ; 

.Y-fV< 2 ^ means that the points must be below the line AP. 
The only points satisfying all these conditions are contained in 
the shaded area. 
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Draw perpendicular to AD meeting DA^ and Al^D^ in h^, b.^ 
respectively, and CQ perpendicular to AD meeting A^D^ and A^D^ 



Fig. at. 


in , Cj respectively. Then the space of time over which X and Y can 
meet between the points B and C on their journey is represented by 
the area b^b^Mc^c^. 

• j , zrez b,h,Mc,Cf, 

The required chance= . 

area KLM.\ 

By simple geometrical methods the value of this is found to be . 


EXAMPLES 13 

1. There are m posts in a straight line at equal distances of a yard 
apart. A man starts from any one and walks to anv other; prove that 
the average distance which he will travel after doing this at random 
a great many times is J (m + i) yards. 

2. If two milestones be selected on a straight road n miles long, what 
is their average distance apart? 

3. Two quantities are taken at random from o to a; find, by means 
of the integral calculus, the chance that the greater of the two is less 
than a given value b. 

4. Find the mean value of the reciprocals of all quantities from n 
to 2n. 
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5. A ladder of length I can be safely used, without its being secured, 

at any angle to the ground between ^ and If any angle between 

4 3 

these limits is equally likely, find the mean vertical height reached by 
the ladder in an infinite number of placings. 

6. OP is a straight line of length p. A fixed point Q is taken in OP 
such that OQ = q. Two other points are taken at random in OP. Find 
the chance that they both fall in OQ. 

7. A point is taken at random on a given finite straight line of 
length a. Find the mean value of the sum of the squares on the two 
parts of the line. Find also the chance of the sum being less than this 
mean value. 

8. A semicircle, APB, stands on a base, AB, of length 2r. P is a 
point on the circumference, and AP, BP are joined to form the right- 
angled triangle, APB. Find the mean value of the area of the triangle: 

(1) if P is a point chosen on the circumference at random; 

(2) if P is fixed by choosing a point N at random in the base AB, 

and erecting a perpendicular from N, to meet the circum- 
ference at P. 

9. Find the mean value of the ordinate of a semicircle, the points 
along the diameter at which the ordinates are taken being equidistant. 

10. In two opposite sides of a square, whose side is of length a, 
points P and Q are taken at random and are joined by the line PQ 
which thus divides the square into two pieces. Find the mean value 
of the area of the smaller of the two pieces. 

1 1 . Find the mean of the square of the distance of a point within a 
given square of side za from the centre of the square. 

12. A straight line of length a is divided at random at two points. 
Find the mean value of the product of the three segments. 

13. A point is taken at random within the area bounded bv v = .v log x, 
the x-axis and the ordinates .v= i, .v = 4. Find the probability that the 
distance of the point from the y-axis is less than 2. 

14. Three points are taken at random on the circumference of a 
circle. Find the chance that the sum of any two of the arcs thus cut off 
is greater than the third. 

15. A straight line is divided into three parts by two points taken 
at random. Find the chance that none of the three parts is greater than 
five-eighths of the line. 
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1 6. There are two clerks in an office, each of whom goes out for an 
hour for lunch. One may start at any time between 12 and i o’clock, 
the other at any time between i and 2. All times within these limits 
are equally likely. Find the chance that they are not out together. 

17. Find the chance that the roots of the equation x^ — zax + b^o 
are real, where a and b are positive proper fractions chosen at random. 

18. The point M is the centre of a line LMN of length 4<2. Two 
points P, Q on the line are chosen at random. Find the chance that 
the sum of the two distances MP and MQ is greater than a. 

ig. If on a straight line of length {a + b) two lengths a and b are 
cut off at random, find the chance that the common part does not 
exceed a length c. 


20. In a line AB of length 3^, a point P is taken at random and then 
in AP a point Q is taken at random. What is the probability that 
PQ exceeds a? 

21. The sides of a rectangle are taken at random each less than 
an inch and all lengths are equally likely. Find the chance that the 
diagonal is less than an inch. 

22. Three points are taken at random on the circumference of a 
circle. Find the probability that they lie in the same semicircle. 


23. Two points are taken at random on a given straight line of 
length a. Prove that the probability of their distance exceeding a given 

\ c 

length r ( < fl) is equal to ' i ^ ■ • 


24. OA and OB are straight lines of length a at right angles to one 
another. P and Q are points taken at random in OA and OB respectively. 
Find the chance that the area of the triangle OPO is less than 

25. A starts to walk from A to T 3 miles apart at 3 miles per hour. 
On the journey he unknowingly drops his handkerchief, but discovers 
his loss when he has covered half the remaining distance. He then 
proceeds to retrace his steps at 4 miles per hour. B starts on the same 
journey at 3 miles per hour 5 minutes after the handkerchief is dropped. 
Find his chance of reaching the place where it was dropped before 
A docs. 

Assuming that B picks it up, find the mean distance he would have 
to carry it to restore it to A if the series of events were to take place a 
large number of times. 

26. Find the mean distance between two points on opposite sides of 
a square whose side is unity. 
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1. Find the sum of n terms of the series i, z, 4, 9, 19, 36, 62, .... 

2. A person writes four letters and four envelopes. If the letters are 
placed in the envelopes at random, what is the chance that not more 
than one letter is placed in its correct envelope? 

3. Find the mean age and median age of the following distribution: 


Age 

Population 

1,000’s 

Age 

Population 

1 ,000’s 

20 - 

382 

55 

148 

25 “ 

379 

6c 

115 

30- 

348 

65- 

84 

35 - 

3*6 

7 c 

52 

4Q- 

272 

75 “ 

26 

1 45 - 

234 

80- 

1 1 

! to- 

•94 

8 t- 

4 


4. Given Uq= 1027, 1^^=1212, «jj=i469, «jj, = 2oi4, explain 

(1) how you would complete the series from «o to 1/,; 

(ii) how you would proceed if you were asked to complete the 
series from to Wjj supposing that it were unnecessary to 
find Uj, ti2, U3, 1/4 and Mj. 

5. The faces of a cubical die are marked i, 2, 2, 4, 6, 6. Find the 
chance that, in ten throws, four 2’s, two 4’s, four 6’s are thrown. 

6. Find the tenth term of the serie.s : 

(а) I, 4, 13, 3(1, 97, 268, 765, ...; 

(б) 2, 12, 36, 98, 270, 768, .... 

7. Show that = + and hence prove by mathe- 

matical induction that 

= ti5,A''Ux + «(i) Af jA"-'Mx.4.i + »,2, AV,A’‘--’«x . 

+ n, 3 ,A^'x-^”"®“x+ 3 +---- 

8. Explain the mathematical meaning of the word “probability”. 
If the probability of the h.appening of an event at each trial be p, and 
It trials are to be made : 

(r) What does /)« represent? 

(2) What is the probability that the event will happen at least 

m times? 

(3) What is the most probable number of times the event will 

happen ? 
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9. Prove that in the process of obtaining divided differences of the 
function u^, given u^, u^, u„ the last divided difference is numerically 
the same whatever the order of the arguments and the corresponding w’s. 

10. Show that 

^(2) ^(3) 

(1) Xjc"' = C+ , Ao'"+ - t A*o"‘ + ...; 

2I 3! 

v(2l 

(2) = C + , A1/0+ , A% + .... 

2! 3! 

11. Two Companies A and B make simultaneous issues each of 
1000 bonds. Those of Company A are redeemable by equal drawings 
spread over 20 years, and those of R by equal drawings spread over 
40 years. Find, in the case of two definite bonds, one of each issue : 

(1) the probability that the bond of Company B is redeemed before 

the bond of Company A\ 

(2) the probability that the bond of Company B is redeemed before 

the bond of Company A and within 1 5 years of issue. 

12. If Up. “51 “is four values of a function at equidistant points, 
find expressions true to third differences for u« and Ug, solely in terms 
of Up, Ug, ttio and 

13. Explain what is meant by the terms median, histogram, ogive, 
illustrating your answers by rough diagrams. 

14. Two throws are made, the first with three dice and the second 
with two. What is the probability both that the first throw is not less 
than II and that the second throw is not less than 8? 


15 - 


Show that the series whose nth term is 

L3-5 (2”Z3J A 2 (n- 1 ) 

S"-! (w-i)! 


»x 


-n+J 


is equivalent to 

16. Obtain the approximate quadrature formula 
fit 

I ^m,<& = 2V(27“»+i7«]+.5«2-“s)- 


17. The numbers of members in a Friendly Society were available 
for the following years ; 


Year 1922 1923 1924 1925 1928 

Number of members 995 998 1003 996 976 



MISCELLANEOUS EXAMPLES 


317 

It was desired to obtain estimates for the years 1926 and 1927. This 
was effected on the assumption of a constant fourth difference. Sub- 
sequently it was discovered that the numbers for 1926 were actually 
1002, and a fresh estimate lor the year 1927 had to be prepared. Calcu- 
late the original estimates for 1926 and 1927, and find the revised figure 
for the year 1927. 


— he 

18. If b and c are positive quantities (b>c) and if may 

OJr — o — 3f 

have any value between b and c, all such values being equally likely, 
find the probability that x is real. 


19. Establish the formula for the difference of a product of two 
functions, 

and deduce the formula for the sum of a product, 

n — 1 Ti — 1 

JlAsO x = 0 


n - 1 

By means of this formula find the value of E x’^. 

j— 0 

a-^ X 

20. If E = for all integral values of a, prove that, to the third 
order of differences, 

u- = -aw, - -008 (a.-,o - 2W5 + u'g). 

Given the following table, find u,, and iq,: 


a 

1 0 

5 ; 

10 ' 

15 ; 

20 

a +4 

■0427 1 

1 

■1467 1 

•2459 

1 

•3408 i 

■43 « 7 

x^-a 

.1 L 

1 

i 

1 

1 



21. If events A, B, and C are independent of each other, and events 
E and F are mutually exclusive and are both contingent upon the hap- 
pening of A, give an expression for the probability that either E or f 
will happen and that neither B nor C will happen. 

22. The equation 

= 15.V 

has a root between i and 2. Obtain it to three places of decimals by 
inverse interpolation. 



3i8 mathematics for actuarial students 

23. The following formulae for approximate integration are correct 
to third differences : 


f+S 

J ^ ^ = f (3M_J + 2«„ + 3Mj), 

C+3 

J u^dx = u_ 


“-s + 4 “o + «s- 


Prove that if these formulae are applied to a function whose fifth 
differences are constant, the respective errors involved in the approxi- 
mations are in the ratio 7:18, and are in opposite directions. 

By a combination of the two formulae obtain an expression, correct 

C +3 

to fifth differences, for J u^dx. 


24. Define standard deviation. Find the mean and standard deviation 
of the following distribution ; 


Length in cms. 

No. of cases 

10- 

2 

» 5 - 

— 

20- 

8 

25- 

1 1 

30- 

12 

35 - 

16 

40- 

25 

45 - 

17 

5 ^ 

8 

55 

I 


25. If third differences of u are constant, express u„ in terms of 

«5, Auj, and 

26. If three different whole numbers from i to 30 inclusive are 
chosen at random, what is the probability that their sum is equal to 30? 

27. In each of two adjacent sides of a square a point is taken at 
random. Find the chance that the length of the line joining the two 
points is between \a and a, where a is the length of a side of the square. 

28. Draw the graph of y = *® — 3x-(- 1. By reference to the graph, 
supplemented by arithmetical trials, find approximately the value of the 
negative root of the equation x® — 3X 1 = o, correct to two places of 
decimals. 
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29. A and B throw in turn with two dice, A having the first throw. 
A is to win either (i) if he throws a double six in his first six throws 
or (2) if he throws a double six before B has a throw scoring 9 or more. 
Find an expression for .d’s chance of winning. 

30. Find the mean length of a straight line drawn from one of the 
angular points of an equilateral triangle to a point taken at random in 
any one of the sides. 

31. Find an expression, correct to fourth differences, for the value 
of when jc= 1, in terms of u__^, 1/ ,, u„, Uj, u.^. 

32. Show that, if fourth differences of u are constant, 

[5]- 1) K+ 2 -« 2 ) = -^Xu 

33. Prove that 

I'l Jx I / I , a 

2 ^ Z 1 9 

)n i+x^ 4<i \ 6a , | a^ + a:* 

approximately, where a is a positive integer. 

By putting fl = 3, obtain the value of rr to three places of decimals. 

34. Three numbers are selected at random, one at a time, from the 
five numbers i, 2, 3, 4 and 5, repetitions being allowed. Find the prob- 
ability that the third number selected is not less than the second and the 
second is not less than the first. 

35. By means of the formula (taking n= 10) 


)nn j — m ^ j 

^ ^ r ni ~ (*^a+mn ^o) 

J --0 x -0 2 


n'^ — i 

; dUx\ 




12 

( * L 

^dx\ 


approx., 

find the approximate value of 

I 5°- 
logio 

, given that 

*ogio-2 = -30 

logio 3 = -4771. logio <’ = ■4343- 





36. Given that 

«„ = 

16 




«l + «2 = 

64 



«.i + 

«4 + Ms = 

266 



tt, + a7 + Mg + M8 = 

1029 




find the values of u^ and Uj, on the assumption that is constant. 

37. Through two points taken at random in a diagonal of a square, 
two straight lines are drawn parallel to one of the sides and to each other. 
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Find the probability that the area of that part of the square between 
the two lines is not less than one-third of the area of the whole square. 

38. The number of herds of cows in a certain district is as follows ; 


Number of 

Number of 

cows in herd 

herds 

I-IO 

311 

11-30 

538 

31-40 

227 

41-50 

199 


Estimate the total number of cows in the 550 largest herds. 

39. A and B play a game in which tlity toss a coin alternately. If 
the coin turns up head the thrower scores two points; if it turns up 
tail his opponent scores one point. The winner is tlic one who first 
scores at least seven points. A has scored four points and B two, and it 
is A's turn to toss the coin. Find the chance that A will w'in the game. 

40. In a lottery there have been sold 2500 £1 tickets numbered 0001 
to 2500 and 5000 lor. tickets numbered 2501 to 7500, and it is arranged 
to draw £500 worth of tickets, the holders of which will receive 10 times 
the face value of their tickets. The following method of drawing the 
winning tickets has been suggested. Ten counters marked with the 
figures o to 9 respectively are to be placed in a bag. A counter is to be 
taken at random from the bag and replaced ; this is to be done four times 
and the figures obtained are to be written down in the order in which 
they are drawn to form a four-figure number. Drawings are to be 
carried out in this way, the tickets bearing the numbers so drawn to 
participate in the prize, until £500 worth of tickets have been drawn, 
subject to the provision that any drawing giving a number to which 
there is no corresponding ticket or a number which has already been 
drawn or a number which w'ould cause more than ^500 worth of tickets 
to be drawn is to be ignored. 

The following criticisms of the suggested method have been made ; 

(i) The holders of the ioj. tickets have an advantage over the 

holders of the £i tickets, since although the los. tickets cost 
only half as much as the £ i tickets, they have the same chance 
of being drawn. 

(ii) The system by which it is possible for a number to be drawn 

to which there is no corresjwnding undrawn ticket produces 
inequalities in the chances of the various ticket-holders. 
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(iii) The system of making the drawings is inequitable as it does 
not ensure that the correct proportions of £1 and los. tickets 
will participate in the prize. 

You are asked briefly to consider these criticisms and to advise whether 
the method is fair in all its aspects. 

41. A school contains 330 children, all of whom were born in the 
eleven years 1920-1930. Four times the total number of children bom 
in any group of five consecutive years is equal to five times the number 
born in the first and last years of the group p/us ten times the number 
born in the middle year of the group. The total number of children 
born in 1924, 1925 and 1926 is three less than three times the number 
born in 1925. Find the number born in 1925. 

42. Show that 

I 

approximately, where h is the interval of differencing. 

43. Establish Hardy’s formula “39a”; and use a Hardy formula to 

ri 

find the value of J (i +x-)-^dx correct to four places of decimals. 

44. A and B play a match of seven games. A ’s chances of winning, 
drawing and losing any game are as 5:3:2. One point is scored for a 
win and half a point for a draw. Find the chance that the match is 
drawn. 


45. Prove that ^2 + K3- ••• where 

is a real positive quantity which diminishes as n increases, and 
Lt u„ = o. 

In the series i— i + i — J + ... find the value of A’’u„ and prove that 
this series is equivalent to the series 


1 

2 


I 1.2 1.2.3 

I +- + - + , 

3 3-5 3 S -7 


46. Dse the conception of finite differences to prove that the general 
term in the recurring series U(, + «,* + u2x:‘^ + W3.r3+ ... (scale of relation 
I —px — qx') is of the form Aa^ + Bb”, where a and b are functions of 
p and q, and A and B are constants. 

Prove that every series whose coefficients form an arithmetical pro- 
gression is a recurring series, and that the generating function is 


a + {d—a) X 


where a is the first term and d the common difference of the progression. 


F U A s 


31 
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47. A bag contains four black balls and eight white balls. Two balls 
are drawn at a time and replaced, this operation being performed six 
times. Calculate the probability that two black balls are not drawn four 
times consecutively. 

48. Obtain a formula for the finite integration of any rational integral 
function of x and apply it to find the sum to n terms of the series whose 
rth term is (r'^ + 1) (r — 2). 

49- «26:3o= 16-3^1: ^30:30 = 15784; a2fi:8s = iS'66o; 

<*35:30 = I4‘42° ; <*25:40= *4'824; <*3o;s6= 

Find as accurately as possible 0^:33. 

50. A and B toss a coin in turn, a head counting two and a tail one. 
The winner is the person who first scores a total of exactly three. If 
either tosses a head when his score is already two, his score is reduced 
to one. Calculate yl’s chance of winning the game if he has the first toss. 

51. By successive approximations based on the values of ac* — 5X+3 
when x = o, i, 2 and 3, find to two places of decimals the smallest positive 
root of the equation - 5* + 3 = o. 

52. Two points are chosen at random on the circumference of a 
circle of radius r. Find the chance that the length of the chord joining 
them is less than r v/3- 

53. A die whose sides are marked, i, 2, 3, 4, 5, 6 is thrown five times. 
Find the probabilities: 

(a) that the product of the five throws is 432 ; 

(b) that the sum of the first three throws is exactly three more 

than the sum of the last two throws. 

54. Complete the series to «,5 by means of Everett’s formula ; 

x -5 o 5 10 15 20 25 

Mj, 6i-o 91-4 ii3’6 *34'2 *79'4 2380 2962 

55. The probability that A will die within ten years is -2 and the 
probability that A, B and C will all be alive ten years hence is 42. The 
probability that at least one of the three will be alive ten years hence 
is *985. Find the probability that A and B alone will be living at the 
end of the tenth year. 

56. Explain clearly the meaning of the term dispersion in statistics. 
What are the most usual methods of measuring dispersion? Indicate 
the advantages and disadvantages of these methods. 



MISCELLANEOUS EXAMPLES 


323 

57. A and B throw for a certain stake, each throwing with one die; 
/I’s die is marked 2, 3, 4, 5, 6, 7 and B’s i, 2, 3, 4, 5, 6. Prove that ^’s 
expectation is 47/72 or 21/31 of the stake, according as equal throws 
divide the stake or go for nothing. 

58. Let <^(*1), <A (*j), <^ (•*«) he the values of a function 

corresponding to n equidistant values of x distributed over the range 
b — a. Define the mean value of (x) over this range (i) as a limit, and 
(ii) as a definite integral. 

Extend your definitions to a function </> (x, y) of two variables x, y 
for the ranges b — a of x and ^ — a of y. Hence find the mean value of 

gX+u 

(e^'F-r(7^Te^<') 

foro^x^i, o^y^i. 

59. Four variables x, y, z and u are connected by the equations 

1 oojy = I + Qx*, 

I02r=:2 + 7y'*, 
m = 3 + 5~- 

If you were told to construct a table showing the value of u (but not 
the values of y and z) corresponding to each integral value of x from 1 
to 20, how many values of u would it be necessary for you to compute 
by means of the above equations before you could fill in the remaining 
values by a finite difference method? 

Write down the formula by which you would calculate these remaining 
values. 

60. In an examination the numbers of candidates who obtained 
marks between certain limits were as follows: 


Number of marks 
o-tq 
20-39 

40-59 

60-79 

80-99 


Number of candidates 

41 

62 

65 

50 

»7 


Estimate the number of candidates who obtained fewer than 70 marks. 

61. Explain Sheppard’s rules. 

Use the rules to express in terms of 

(i) «o, Amo, A*m_i, A*m_„ A«m_„ A®u_s, A««_,; 

(ii) Mo, Am_i, A*m_j, A»m_„ A*m_i, A'‘«_„ A*i/.,. 


ir-a 
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62. (i) Explain, giving a simple example in each case, the meanings 
of (a) a probability depending on mutually exclusive events, (b) expecta- 
tion and (c) probable value. 

(ii) If two dice are thrown and six coins are tossed, find the probability 
that the difference between the numbers shown by the dice is equal to 
the number of heads shown by the coins. 

63. (i) In a straight line AB of length a, two points X and Y are 
chosen at random. Using a geometrical method, find the probability 
that XY exceeds a length b (where b<a). 

(ii) If X is first chosen at random in AB and 1’ is then chosen at 
random in AX, explain carefully with the aid of a diagram why a similar 
method cannot be used and find the probability in this case. 

64. The equation x^ + 2x — 2o = o has a root between 2-4 and 2-5. 
Determine the value of this rcxrt correct to four decim.il places by a 
method of inverse interpolation. 

65. A large army consists of men between ages 20 and 40, the number 
at age x being proportionate to a + bif, where a, b and c are constants. 
If the numbers at ages 20, 30 and 40 are proportionate to 100, 68 and 20 
respeaively, find, correct to one decimal place, the average age of the 
men in the army. Given 

logic 2 = -301. logic 3 = '477. and logw e = -4343. 

66. If three whole numbers are chosen at random, what is the chance 
that their product is a multiple of 27? 

67. The winner of a game is the one who first scores four points, with 
the proviso that if two plavers score three points, the game continues 
until one player has scored two points more than the other. A’s skill is 
to B's as 2:1. Find A's chance of winning the game if he ow^es one 
point and B receives a start of one point. 

68. (i) Prove that ■ ^2^ (interval of differencing h). 

(ii) If be a function of the form 

6, * + + *3*4 + . . . to infinity, 

show that it can be expressed in the form 

I -*^(1 -*:)*^(l 

69. On March ist a plant 6 inches high was put into a greenhouse. 
On March 8th its height was found to be 10 inches, and on March 29th 
14 inches. Estimate its height on April 12th 
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(i) assuming that the height (h) is a rational integral function of 

the time (t), 

(ii) assuming that k and t are connected by a relation of the form 

h~a + bt^ ; 

and explain why one method should give a better answer than the other 
in this particular case. 

70. The following data are available: 

Age X 32 37 42 47 52 57 

''j- 3.V36 33'25 30'72 27-23 23-16 19-11 

It is desired to obtain with as little labour as possible, and it is 
suggested that 18-71 would be a reasonable approximation. Do you 
agree with this.? Give reasons. 

From the above data, obtain a value for <>44^. 

71. A die is thrown repeatedly until the total of the numbers turned 
up reaches 5 or more. What is the probability that one of these numbers 
is a 6.? 

72. If Uj. be a fujtction whose differences, when the increment of * 

is unity, are denoted by &u^, ... and by A«„ ... 

when the increment of is n; then if g^+i, •••are in geometric 
progression with common ratio </, show that 

AUj — _ B-Uj. 

73. Prove that, approximately, 

I2.Wo = [.S?(«0+A« 2-Atti). 

74. Two men throw for a guinea, equal throws to divide the stake. 
A uses an ordinary die, but B uses a die marked 2, 3, 4, 5, 6, 6. Show 
that B thereby increases his expectation by 5/i8ths. 

75 - A bag contains five counters marked 1 to 5. One counter is 
drawn at a time and replaced. What is the average number of draws 
required in order that counter No. 5 may be drawn twice? 

?(>• If = 

“10 = 27. 

“20 = ^2, 

find the value of x satisfying the equation 

+ Mio+e + “»+* = (“io-» + “20-* + “ao-i) — 42- 
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77. The goals scored by two teams A and 5 in a football season were 
as follows : 


Number of goal*? 
scored in a match 

Number of matches 

A H 

0 

27 17 

I 

9 9 

2 

8 6 

3 

5 5 

4 

4 3 


By calculating the coefficient of vunatiun in each case, find which 
team may be considered the more consistent. 

78. Find Uj;] from the following table of using all the values 

given : ^ 

/ ' ■ ' 

0 12 
To 16*25 15*08 13*49 

> 5-55 14-58 1315 

1 2 14-48 13-74 

79. During a certain month of the year it is reckoned that if at any 
time the weather is fine the odds are four to one against its breaking up 
within a week. At the beginning of this month, while it is fine, a gardener 
applies to his lawn a weed-killer for the success of which it is essential 
that the weather should remain fine for two days but should break up 
within fifteen days. Find to two places of decimals the chances that the 
weed-killer is successful. 

80. A match consisting of a maximum of five games is played between 
A and B. The chances that any game is won by A, won by B of drawn 
are equal. Each player scores one point for every game he wins and half 
a point for every game draw-n. The mat^h ends as soon as one of the 
players has a sufficient lead to leave him with an excess of points over 
his opponent even if the latter were to win all the remaining games. 
If a large number of such matches were played, what proportion of 
them would you expect on the average to end with the third, fourth 
and fifth game respectively, and what proportion would you expect to 
be drawn ? 

81. If is a rational integral function of the fourth degree in x, and 
if u^ = U2 = «] -I-M3==U]M3+ I =M4 + 2 u, = 0, and Mj is negative, find for 
what values of « greater than 4 

n - I 

6(n-2) S i)(n- 4) 

XmQ 
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82. If / (x) is a rational integral function of the third degree in x, 

and <i> (x)=f {x)+f (t — x), show that f (x) dx can be expressed in 

Jo 

t 

the form S f (x) -{ atj) (o) + bJ) (i) + cd> (2), where a, h and c are in- 
dependent of /, and find the values of these constants. 

83. Four people A, B, C, D and twelve others are to be seated at 
random at a round table. Find the chances that; 

(i) these four people are arranged in two separated pairs, A and 

B together, and C and D together, 

(ii) all four are separated from one another. 

84. Explain the difference, if any, between Aj,A, and A^.^. 

Find <144:5], given 

'*4«:o5 = 9'79^ ‘^4o.«o = 8'553 

^16:50 = I°'59^ ‘^45-66=^9‘5^3 

'^S0:60 = I°OS9 

85. A certain type of tag consists of a “ bootlace” with a cylinder at 
one end into which the tag at the other end fits. If any number N of 
exactly similar tags be held in the middle so that the cylinder ends 
hang down at one side and the tag ends at the other: 

{a) what is the chance that if, say, n tags be fitted into n cylinders’ 
at random, both ends have been chosen from the same “ boot- 
lace”, so that n loops are formed? 

{h) if all the N tags be fitted into all the .V cylinders, what is the 
chance that one large loop is formed i’ 

86. Three metal discs are numbered i, 2, 4 respectively on one side; 
the other side of each disc is blank. The discs are tossed three times, 
and the numbers showing up are added. A is to win a stake from B 
if the total is 8 to 13 inclusive, while B wins if the total is less than 8 
or more than 13. Find the odds in favour of B’s winning. 

87. In a certain town the post office has four telegraph boys who 
take it in turns to deliver telegrams. If a resident in the town receives 
telegrams on n separate occasions, what is the chance that they will be 
delivered by at least three of the telegraph boys? 

88. A player throws two dice simultaneously and is to continue 
throwing them so long as no figure less than a 5 turns up. If two 6’s 
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turn up simultaneously is to receive a prize amounting to on the 
first occasion it happens, on the second occasion, on the third 
occasion and so on. If two 5’s turn up simultaneously he is to receive 
£2 on the first occasion, £ 2 '‘ on the second occasion, ^3® on the third 
occasion and so on. 

Find the value of his expectation. 

89. If interpolated values are found in the intcri'al a; =o to a'=i 
from the values m_j, Wq, «j, u.,, by means of the formula 

t/j, = f !/u + — 

+ - (where 1 - .v), 

and in the next interval a — 1 to a: = 2 by the corresponding formula 
based on the values f/j, u., and show that: 

(1) The given values «q, i/j and m will be reproduced by the inter- 

polation. 

(2) The two interpolation curves have the same differential coefficient 

when x=i. 

(3) The interpolated values for wj and agree with those given by 

the ordinary third difference interpolation formula based on the 
same values of t/j. 

Given the following values: 

n_6 = 1 000. Wio = zbop, 

«o=i 403 . *<16 = 3487. 

«o= '931, 

complete the table for unit intervals from to U]„ by the above formulae 
and calculate the value of the differential coefficient of the interpolated 
curves when * = 5. 

90. A bag contains 10 counters numbered i to 10 respectively. A 
counter is drawn at random and replaced in the bag ; this is done three 
times, the numbers on the counters so drawn being added together to 
give a total. The process is repeated as many times as may be necessary 
to obtain three totals each of which does not exceed 12, any series of 
three drawings which produces a total exceeding 12 being ignored. 

Find the chance that these three totals will (if arranged in ascending 
order of magnitude) form an arithmetical progression of common 
difference 2. 
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91. Having given the present value of per annum at the end of 
20 years at the undermentioned rates of interest per cent. : 


Rate per cent. 


2 


3 

3 i 


4 

4i 


Present value of per annum 

16-351433 

15-589162 

14-877475 

14-212403 

13-590326 

13-007936 


use Everett’s formula of interpolation to find the present value of 
per annum at the end of 20 years at 3-2 per cent, per annum. 

92. The estimated distribution of cinema admissions in a certain j ear 
was as follows : 


Price of admission 

Number of admissions 

in pence 

in millions 

2 

13-6 

3 

74-4 

4 

52-2 

6 

272-5 

9 

164-7 

12 

193-2 

15 

53-8 

18 

91-1 

20 

47-8 


Give a sketch of a frequency distribution, showing the median and 
the mode. 

93. If .V is a whole number chosen at random between i and 99 
inclusive, find the chance that x(ioo-a:) will exceed one-half of its 
maximum possible value. 

94. A floor is ruled with equidistant parallel lines, A rod, shorter 
than the distance between the lines, is thrown at random on the floor. 
Prove that the chance that it falls on one of the lines is zc’rra, where c 
is the length of the rod and a the distance between the parallels. 

95. If in an examination six men are bracketed, the extreme difference 
between their marks being 6, find tlic chance that they have all obtained 
different marks. 
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96. The xth term of the series i, 2, 17, 72, 243, 754, ... is of the form 
a-^bx + c^ + d*. Determine a, b, c, d and find the sum of n terms of the 
series. 


97. A team of eight men is engaged in a practice shoot. Each man’s 
probability of hitting the bull on his target with any one shot is f. 
Each man fires one shot; then those who have not hit the bull fire a 
second shot, and so on, the unsuccessful men at each trial firing again. 

What is the minimum number of trials that should be made in order 
to ensure an even chance of all the men hitting the bull? 

98. In a certain game the player throws a ball on to a board con- 
taining twelve holes into one of which the ball is bound to fall. It is 
equally likely to fall into any one of the holes, which are marked 
o, o, o, o, I, I, I, I, 1, 2, 2, 2 respectively. If it falls into a hole marked 
0 it is “lost", but if it falls into any of the others it is returned to the 
player, who continues to throw the ball until it is eventually “lost”, 
when the game ceases. At each throw the player scores the number of 
the hole into which the ball falls. 

Show that the probability that the player obtains a total score of n 
is the (fl-l-i)th term of a certain recurring series and hence find this 
probability. 

99. A and B both wish to journey from A’ to Z by tram. A decides 
to take a tram which travels direct from X to Z in 55 minutes, while 
B decides to go by a different route, taking first a tram which travels 
from AT to y in 20 minutes and then another which travels from 1 ' 
to Z in 30 minutes. Trams run from X to Z, from X to 1 ' and from 
y to Z at intervals of 10 minutes, 7 minutes and 7 minutes respec- 
tively, but the times of running are not known. If A and B arrive 
at the same moment at .Y to wait for their respective trams, find the 
chance that A arrives at Z before B. 

100. If tV_i = U_T + U.^ + U_^ + U 4 + U_3, 

«J, = U , 4 M_ j -I- Hq -I- W, + 1 / 2 , 

W, = U3 4 4- Mg + U, , 

K ’2 = “8 + “» + «10 + «11 + « 12 . 

prove that 

«2 4- U3 = -2 (Wp 4 - re,) — -032 4 - A^p). 
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Examples 1. 

58. 2 . 30, 42. 3 . 15. 

*• i‘9- 6- 1110. 8. 6aA*. 

9 . J (- + 2S2X>- 1051 1344). 10. abcd.ia'.. 

O® — 2 

12. (i) Jx(.V-l) + l!, iu) c^l{c—l) + k, 

(iii) 3x (.X — I ) (.X — 2) + Ifx f.x — 1) + 3X + where k is a constant. 


13. 


14. 

16. 

20 . 

21 . 


- ■> . 

(x + 2 )(x+ 3 ) (x + 3)(x + 4)’ 

4- - -+ - J 

(x + 2) (.r + 3) (x + 4 ) (x + 3) (x + 4 )(x + 5 )' 

—7 — , — .. 15 . —2 or too. 

.r(x-i)(x-2) 

(i)an!; ( 2 ) i)"- 18. 55 . 19. «»* + (a* + i)* o‘’. 

Jx (x - I ) (X - 2) (,V - 3) + 2.Y (x - i) (x - 2) + ‘:]x (x - I ) + I 2X + ^. 

222 S- 22 . 20. 23 . -161. 24 . 229. 25 . 1261. 


28. x(*>-6.v(’) + i3.x(*> + .r<’> + 9: 4 x<’*-i8x(=> + 26 .v") + i; 

I2.r^*) — 36 .v'‘* + 26; 24v<'* — 36; 24. 

29. (i) (771 4- i) m (tti — i) ... (w — n + 2 ) a'"’*'' ; 


(ii) ( — i)" (771 + i) (77; + 2) ... (777 + 71) {h 'a — i + 

30. 2 cos (x + sin ii; sin |cos (x + i) cos .v} ; i — 2 sin (x + sin 

31. 6.v; 6 '(.v+l)’. 42. y‘ + 4ay=j3’. 43. 2 '.v — 2)" — 2 (x — 3)*. 


1. 465. 

4. 182; 343. 

7. 128. 

10 - 97 .. 357 - 

18. 69,215. 

16. 30; O. 

19. 5281 ; 6504. 

22 . 


Examples 2. 

2- 441; 653. 

5 . 5414. 

8- 94 ; 396; 662. 

11 - 844 : 746 . 

14. 237223. 

17. 14-73658. 

20. 5479. 


3. 300. 

8. 89,920; 89,073. 

9- 194-3; 279-9- 
12. -98127. 

15. --432; - 338; --196. 
18. 3-708; 2-711. 

21. 2153; 1705. 


***J.1J 

2459; 2424; 2359; 2268; 2153; 2018; 1868; 1705; 1534; 1357; 1180 
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23 . -017; -035; 052; ‘070; -087; -104; •122; -139; -IS?- 

24 . 23-1234; 23 2039; 23:2914; 23 3865; 23-4898; 23-6019; 23-7234. 

26 . i-ooo. 27 . -020660; 020625; -020628. 28 . 58,835. 

29 . 1; 2-10; 3-31; 464; 611; 7-73; 9-51; 11-47; i. 3 ' 62 ; iS' 97 - 

31 . 117-7; 114-2; 110-5; 106-7; 102-7; 98 6; 94-3; 89-8; 85-2; 80-3; 75 4. 

32 . -24928. 33 . Third degree; 275. 34 . 459. 

38 . ti, = 2i8; M4 = o; 115=— 19; 1876— 1429.V+ 360.V* — 30.V’. 

Examples 3 . 


1. 

.S 74 ,-;- 

2 . 47,983. 3 . 2-8169. 

4 . 1-7243. 

6. 2300. 6 

7 . 

1 + m 

Im + mil -f 111 Imn + muh + nf>l 4 plm 


r-m* • 

Pm'n" ’ 

piirirp^ 


8. 

13-18. 

6. 14-942. 

10. 20-43. 

11. .162. 

12 . 

659 + 22i.-c + ^ '_x^ — 

13 . 32. 

16 . I ; 25 

18 . 

33 and 67 to the nearest integer. 


19 . 37-2. 

20. 

7 - 37 - 

21. 130.326. 




23 . (.^-4); (.v-3) (.v- 7 '); (X-3X.X-4) (.Y-7)-, ,-r (.v-3) (.v-4) (.■<- 7 V 

24. The ilii/s are u-, -, 1 /,. 

3 3.(1 4.;(.u 4. 3,u, n 

Examples 4, 


1 . 33. 2. 6. 3 . 47,692. 4. 3251. 5 . 16-9216. 

6. 2-85805; 2-86305; 2-86157; 2-86155. 7 . 2017. 


8. 

35283. 

9. 

2196, 2108, 2022, 1939; 

1786, 1718, 

1657, 1604. 

10. 

01625. 

11. 

-3165. 12 . 2290-1 

14. 4-034. 




Examples 5. 



1. 

471-5; 2-7. 

2. 

13-3. 3 . 2-019.. 

.; 2018 


4. 

431 - 

5 . 

8-34. 6. 2-751. 

7. 

16-9. 

8. 

1-1576.... 

9. 

I-2134. 10 . 45-70. 

11. 

1-85. 

12 . 

3-091. 

13 . 

3-667 per cent. 14 . 

1 - 3713 - 

15 . 37 - 2 . 


Examples 6. 

1. iw ( — 2«* + 27n+ 17). 2. Vi” (”+ 1) ( 3 «* + 7 n + 2)- 

3 . —4195. 4 . 2*' + 628. 8. 3" + ' + i (■«’ + 7r; - 6). 

*• i (i (3"- + iTi+ I) ( 2 n + l)}. 

7. 2**+* — 2- JA (2A-f 1) (4^+ 1). 8. i (n*- ion’ + 29 k’+ ion). 
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9 . 2«‘+ i6«® + 47«“ + 6on. 10. z’* — 2095. 

11- i (« + 3) (” + 4 ) (« + 5) (” + 6)-90- 
12. Jn (w4- 1 ) (n + 4) (n+s;. 

13 - I'j {( 3 «- 2 ) ( 3 n+ i) ( 3 « + 4 ) (■’*: -r?) + 56}. 

14 . (n 4 - 1) (» + 2) (3W+ 13). 

16. ,}(,n {n+ i) (n + 2) (’ 9 ,<*+ 57 /i 4 - 137). 

16 . {f2n + 3) (2f> 5) (2« + 7) (2n + 9) (2n+ it)- 1039s}. 

17 - " 18 (•' 5 "+_' 3 ) jg n(n+i)_ 

4(«+4)‘ ■ 12 (m+ 2 ) (w + 3)‘ ■ 6(„ + 3 )(„ + 4 )- 

20 «(3n-r.s) 21 w(5” + J:0 

8 (3/i+ i) (311 + 4)" ■ 4(«+i){« + 2)' 

22. I'rti (?i-t I ) fn-t- z) (3/1^ + 3611+ loi). 


23 - i 2 ir-r 33 «-‘ 19 . I 9 

16S 6 (3W+ i) (3W + 4) (311 + 7)’ if)8‘ 

24 . H (n + I ) (31)^ + 3I« + 74). 

25 . i (n-t-3) (ii-*-2i 0!+ i) n, ]n (n— i) (m — 2) (« — 3). 


26 . 3'*— 1— n. 


27 - 2 l 'vi_ +“(‘’+01 . r 

/Z— J .1 tt — l ((2 — J)2 ) 


28 . I — 3 («+ I ) ' + (n + 2)!. 

29. il- {( 3 H- i) (311 + 2 ) (3" + 5) (3'»+i^^ (3”+ ii) + 88 o!. 

30 . in (6«''+ i6«’+ 9« — 4). 31 . n ( 7 /i’— 34n= + 89n — 254). 

38 . (n+ i) (9n'“+ i7n + 4): {2"+M«’- 311* + ion - 14)] + 28. 


34. 3 

3 3 I n + 2 1 

37 . (x-2)3-. 3 

{n + 2)\’' — 2 >■"+’ — V 

~.^-I (,V-I)^" 


35- 36, 34-(4n'< + I 2 n+ i7)/2'‘"', 

38 . .5 (3"- i)+ 1+ (4211+ i7n= + n*). 

40 c 12^ '’ + 3 6 ^’ + 2 8 v + 3 

I 2 V (.V+ I) (v+ 2 ) (v + 3 )’ 


1 139 36/1 + 39 \ 

54 1 10 (3"+ 2) (3/1 + 5)/ ■ 

2 — (n + I Hn + 2) ^ 


43 . 23. 


44 . n*2"'''*. 


+0 +('i -:vp 0 


“• ty'lr 

S + A (2”- 0 (2/1+1) (2/1 + 3) (24/1* + 54/1 + 25). 

48 . jn.v*- ( J<7 -Ji) .r^ + (ia— ib + ^c) .x‘ + (^h — ic + d) x + C. 


49 . (i-.x)’^*'. 


52 I, (n+i)' ) 

a-il (a+ i) (a + 2 ) ... (a + /j)/ 


63 . 2"'‘/i (n+ i) (/i* + 5n — 2). 
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Examples 7. 


2. 

20-796875; -sV- 



10. 

-02455; —-0003; 0. 

11. 

•109. 

13. 


16. 

48; 24. 

10. 

5*254. 

22. 

13-094. 

23. 

S-3I9- 


a*. 3 9634. 86. 10-389 ; 10-475. 

Examples B. 

2. -99ZI. 3. 20'4 miles. 

*• ih (9S«^)-SO“i + 6ooui-350Uj-f42SM4). 

fl. 200 square inches. 11. -8571...; (i) -8806...; (ii) -8946.... 

12. The second. 13. 29,426. 14. 888. 16. -6942. 

22. -5236. 24. 3142 26. 11358.... 

26. -6931.... 27. 3142 

Examples 10. 


2. 

4: i. 

4. 


5. 4. 

6 

0 


7 . 

T 

8. 

A* 


10. 1 
n! n! 

(i)“ 

11. s'l 

• 


12. 


la. 1',. 14. 

ill 



16. 

15 ><35’ 
6'» ■' 

17. 

(a) js"; 

w w —y 

L)_ 

18. 38???. 


19. 


20. 

77^3 
\ 1 (05* 

21. 1880- 


22. 

138 X 56 

S* ’■ 


23 

44- 


24. 

(l) 1SI^.T05‘> ( 2 ) K^A'oO- 


26. 4. 

27. 

i?Bi) 


28. ^ 

iVo 

20. 

44. 

31. (A)’. 


32. 1.5 

?; 


83. 

m- 


34. 

nm-, 

»6. 

hr 

iS- 

36. 


87. 

iVA- 


38. 

fj-l p^(p + 2q) . 

' (/>-+?) (p’-i-p? + <?’)’ 

(u) 

_P (P + 9) 

P*+P9 + 9’' 


80. 

I ; A 

; ? 

40. 

48?- 

41. mi 


42. j 

i'.3. 

43 

3M(: 
• -21 K 



44. 

- 126 . 

46. 2 X 10“' 

1 

46. (!) i; 

(2) 

ST . 
^00 • 

( 3 ) ti 


47. 

-105. 

40. -98304 or 

-09888. 

60. (I) 

- 72 ; 

(2) 

■504. 


61. 

(1) -099...; (2) -000007 

• • *» 

001 

62 

!. -2 

109. 



68. 

(a) -972; (6) -271. 


S4. il 

■i 





66. 


67. 7 

- 


68. ” + 3 

2Tn-4-l 





88. 

(I) A; 

(2) Ti8«: (3) *Wo 


84. mi 





Sft 

3"*-3« + 2 


log, 2 



67. 

L 


OD. 

9n*-9ii + 2‘ 

tog. (*- * 

ffl , 

^ ; log, 2. 
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68- (i) xh\ (2) A: (3) A- 
(3) i- 

(a I\ij 
76 . 


(i) rii (2) A: 

(4!r( i 3!r 

52! 


73 . 


70. m- 


71. 


3003 


an u 7 ^ 

lllri- 
i3 « 

89 , 

82. A"A. 


81. (,) 


77. A- 
r; (2) A; (3) li 


36«'“- i8 » + 2' 

78. •506.... 


2“ 

76. 


96- Ak%: yjss. 


84 . 3000 : 3007. 85 . J. 

90. ii’i;';. 91. (i) ( 2 ) f, ; 

93. (a) A; (fc) (c) A- 


86. J. 

( 3 ) I'il'o. 

95. AA- 


88 . 


1. 24-9., . years. 

4 . 321; 31-6. 

10. II; 6,?. id. 

12. 6j. 0(7. ; 5s. 0(7. 


Examples 11 . 

2. 64-65...; 64 55.... 

6. 33-34. 

11. ( I ) Between 5 and 6 ; 
13. 35J. 0(7. 14. 6: 


17 . (i) 631.0(7.; (ii) 56?. oJ. 


18. 


3. 77 (approx,). 

6. 7 9...; 9 0.... 

(2) 4; (3) 7. 

I. 16. Purse A. 
19. im(7n+i). 
22. Tip 4- Jw. 


20. 10. 21. (i) IS.; (2) i.rs.; (3) lis. 

23. (i) (2) (3) .Most probable number is 4 ; t-Iu’. 

24. 7s. 2(7. 25. £-13. 14s. ii(7.; ^'56. 5s. id. 


Examples 12. 

1. TIP = 509'6: (7=8 o. 2. 7VP=22-3; M,= i9-5i (7 = 92. 

3 . 9 2 and 6-2. 

■*. 35'3s. (it lilt- extreme values are omitted) or 347s. (if they are included). 

5. A7ii = 8-6, j' 17, = 7-9, Quartiles 10 and 6. M — ys. 

6. c7=:4-2. 7 . ±15111. 8. -27 year. 

9. M=toy, A/, = 57 3; CT= 14-25. 10. B is tlie more consistent. 

11. a= 14-7. 12. A' is the more consistent. 


Examples 13. 


8. i(n+2). 

6 . 7 

P* 

10. Ja*. 

14. i. 


3 . 


7 . W -, 


11 . 


i* 

a’" 

I 

1 3’ 


4 . log, 2. 8. 

ft 

„ , . 2? - , , r*Tr 

8, (i) ; (2) . 

71 4 




(V2-I). 


15- Ji- 


16 . i. 


»2. A«’- 

17 . i. 


13 . 


^log 2-3^ 
64 log 2 - I 5 ■ 


18 .J. 


19 . 


ab' 
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20 . f-Jlog. 3 - 21 . -. 22 . J. 24 . i + ilog.2. 

4 

20. i?: JJ miles. 26. J ( 2 - v 2) + log (i + Vz)- 


Miscellaneous Examples. 


1- + 

8 . 40 - 3 --.; ITT- - 

6. (a) 19,764; (fc) 59,156. 


6. iHii- 

11. (1) i?,; (2) .V 


h- 1’^- 977.961:1023. 18 . 

•0294; ‘0492; -0682. 22 . 1.109. 24 . 38-45; 9 4 

«o + «Ami + in (n - 3) Atij + (n - 4) (n - 5) A'n, . 


26. 

loio- 

27. 

3^ 

16 ‘ 

28. 

- 1-88. 

29. 


(si)* iVs 

30. {a + \a log. 

3- 

34. 

2^- 

36. 

58 9321.... 

36. 

86; 121. 

37. 


38. 

20,536. 

39. 

ii- 

41. 

45- 

43. 

-7854- 

44. 

■ 109 — 

47. 

-/r- 

49. 

15-975- 

60. 

l 

51. 

■66. 

52. 

i- 

63. 

180 780 

6‘ ’ 6‘ ■ 

64. 

113-6, 1170, 120-4, 124-2. 128-8, 134-2, 

141-4, 

1494, 158-6, 

56. 

179-4. 

-14 or -18. 

58. 

il, e>+i7 

4L'»» ' z^l- 


60. 196. 

62. 

(ii) -TaV- 

63. 

(i) (a — by . ' a *. 


64. 2-4695... 

66. 

28-0. 

66. 

Ih 


67. i‘g. 

•9. 

(1) 10 in. (ii) 

15-8 in. 

70. 29-08. 



71. 


75, 

10. 


76. 3. 


77 . B is the more consistent. 78 . 12-50. 

70 . -32. 80 . ./V. s'V. it; li. 81 . 7,13. 

82. a=-lh=^hc=-.h. 83 . (,) ,|S,; (.i) ??. 

{ iV — w ) ^ I 

84 . 10-476. 86. —jvi • ; Y- 86. 15:17 


10-476. 


86. IS : 17. 


-- (4"-6.2"4-8). 

4 


88. 155. 


1403, 1498, 1598, 1703, 1814, 1931, 2054, 2182, 2316, 2458, 2609; 120-6. 
81 - 14-606063. 93 . iJ. 


3^. 88- H2"** + 3"+‘-7«’-«-7}. 


88 . + (-*)". 


89. sVA. 
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Addition rule, 216 
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Aitken, A C., 19, 40, 119, 158. 161, 
165, 169 

Aitken’s notation, 41 
theoiem, 158 
Algebraic series, 112 
Alison, b H , II 
Approximate integration, 173 
Approximation, successive, 89 
A priori defimtion of probability, 
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Argument, 3 
Arras , 234 
Averages, 258 

Backward fonnula. Gauss’s, 64 
Bemouilli, 270 
Bernouilli’s numbers 189 
Bessel’s lormula, 64 171 
Binomial curve, 290 
Binomial theorem, 12, 26 
Boole’s Finite Differences, 19 
Buchanan, J , 139 150, 153, 197 

Genual differences, 61 
Chance, 210 

Change of origin, 32, 178 
Ch inge of scale, 32 
Change of unit, 177 
Class-intirval, 256 
( lass range, 257 
C oefhcient of variation, 287 
Coefficients, detached, 21 
Collins, John, 30 
Compound function, 132 
Compound probability, 218, 221 
C omrie, L J , I 7 t 
C ontict of the first order, 148 
Cm Illation table, 213 
Cross-means, interpolation by, 165 
linear, 165 
quadratic 168 

D’ \lembert, 207, 270 
Dc Morgan, 40, 124, 270 


De Morgan’s Differential and Integral 
Calculus, 193 
Dependent events, 221 
Dct iched coefficients, 2 1 
Difference table, 3 
Differences, adjusted, 56 
central, 61 
definition of, 3 
divided, 39 
leading 4 

Differences of zero, 123 
Dispersion, 279 
Divided differences, 39 
Double integrals, 301 
Duration of play, 240 

Lldcrton's Frequency Curves and 
Correlation, 197 

Flimination of third differences, 90 
Entry, 3 
Euler, 52 

Euler-Maclaurin expansion, 188 
Everett’s formula, 66 
second formula, 67 
Expectation, 267 
Expected value, 268 
Extrapolation, 38 

Factorial notation, 19 
Finite differences, 1 
Finite integration, 107 161 
Forward formula, Gauss’s, 63, 65 
Fraser, D C , 30, 40, 46, 56, 72, 115, 
118, 142 145, 152, 153 
hexagon di igram, 142 
Freeman’s Actuarial Mathematics, 19, 
40 

Frequency, 255 
Frequency curve, 239 
Frequency distribution, 257 
Function, compound, 132 
rational integral, 1 
two variable 135 

Gauss’s backward formula, 64 
forward formula, 63, 63 
Geometrical solutions, probability, 308 
Gregory, James, 30 
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Hall and Knight’s High&r Algebra, 102 
Hardy, G. F., 119, 184 
Hardy’s formulae, 184 
Henderson, K., 153 
Henry’s Calculus and Probability, 40 
Hexagon diagram, 14a, 145 
Histogram, 256 
Huyghens’s problem, 227 

Indefijiite finite integral, 107, 163 
Independent events, 218 
Induction, method of, 237 
Integration, approximate, 173 
by parts, 161 
Interpolation, 26 

by cross-means, 165, 168 
formulae of, 29, 44. 51, 63, 64 
inverse, 81 
osculatory, 147 
Intertals, equal, 26 
subdivision of, 33 
unequal, 39 

Inverse interpolation, 81 

Jenkins, \V. A., 15J 

Karup, Prof., 150 
Kerrich, J. E,, 153 
King, A. E., 197 
King, G., 150, 185 

Lagrange’s formula, st, 137. 139, *98 
Leading differences, 4 
Leading term, 4 

Lidstone, G. 46, 67, 72, 117, 119, 
13!, 152, 159, 161, 162, 163, 171 
Linear cross-means, 165 
Lubbock’s formula, 191 

Maclaurin’s series, i6o 
Mean, 259, abt 
Mean deviation, 280 
Mean value, 296 
Median, 259, 264 
Milne-Thomson, L. M., 19, 40 
Milne-Thomson’s Calculus of Finite 
Differences, 199 
Mode, 259, 265, 270 
Mortality tables, 229 
Most probable value, 270 
Multiplication rule, 219 
Mutually exclusive events, 216 

Neal, P. G., 150 
Newton, 56 


Newton's formula, advancing dif- 
ferences, 29 
divided differences, 44 
Normal curve of error, iyo 

Ogive curve, 2(15 
Operators, A, F, 6 
S, 33 
A. 4 > 

8, 7(> 

E, 106 
D, A, 126 
V, 146 

e . H, 158 

A, A, I bi 

Origin, change of, 32, 178 
Osculatory interpolation. 147 

Parts, summation b> , 104 
Poincard, 270 
Polynomial, i 
Prize, 2t>y 
Probability, 204 

application of calculus to, 304 
geometrical solutions, 308 
Probable error, 289 
Probable value, 267 

Quadratic cross-means, 168 
Quadrature formulae, 185 
Quartiie deviation, 286 
Quartilcs, 286 

Radix, 229 
Range, 278 

Rational integral function, I 
Recurring scries, in 
Remainder terms, divided differences, 
57 

quadrature fomiulae, 198 
Reversion of scries, 81 
Rolle’s theorem, 56 
Root-mean-square deviation, 281 

St Petersburg problem, 270 
Scale, change of, 32 
Semi-inter-quartilc range, 286 
Separation of symbols, 16 
Series, algebraic, 1 1 2 
recurring, 1 1 1 

Sheppard, W. F., 40, 46, 47, 56, 72, 76, 
284 

Sheppard’s notation, 76 
rules, 45 

Simpson’s rule, 174, 181 



INDEX 


339 


Spencer, J., 138 
Sprague, T. B., 150, 193 
Spurgeon’s Life Contingencies, 186 
Standard deviation, 282 
Standard error, 289 
Statistical definition of probability, 208 
Statistics, 254 
Steffensen, J. F., 40 
Steffensen’s Interpolation, 19, 147, 199 
Stirling’s formula, 64 
Subdivision of intervals, 33 
Successive approximation, 89 
Summation, 98 
by parts, 104 
Summation n, 1 14 
Symbolic notation, 3 
Symbols, A, 3 
E, 6 
33. 76 

A. 41 
76 

X, 106 
D, 126 
V, 146 
9 , (■), is;8 
A, A, 1(11 

Symbols, separation of, t6 


Taylor’s theorem, 55, 126, 141 
Tchebycheff’s theorem, 284 
Term, leading, 4 
Thiele, 26 

Three-eighths rule, 181 
Throw-back device, 17 1 
Todhunter, R., 67, 119, 141 
Two variables, functions of, 135 

Unit, change of, 177 

Unitary definition of probability, 208 

Universe of an event, 217 

Variate, 254 

Variation, coefficient of, 287 
Weddle’s rule, 182 

Whittaker and Robinson’s Calculus of 
Observations, 72, 197 
Interpolation, 93 

Wliitworth’s Choice and Chance, 270 
Wickens, C. H., 197 
Woolhouse’s formula, 193 
notation, 79 

Zero, differences of, 123 
Zig-zag rules, 47 





